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5 

FIELnAND_BA,CKGRQU ND OF THF. m vFMT]n>^ 

The present invention relates, in general, to a biotechnological method for 
production of (35,3'5) astaxanthin. In particular; the present invention relates to a 

10 peptide having a P-C-4-oxygenase activity; a DNA segment coding for this 
peptide; an RNA segments coding for this peptide; a recombinant DNA molecule 
comprising a vector and the DNA segment; a host cell or organism containing the 
above described recombinant DNA molecule or DNA segment; and to a method of 
biotechnologically producing i3S,3'S) astaxanthin or a food additive containing 

15 (3S,3'S) astaxanthin, using the host. 

Carotenoids, such as astaxanthin, are natural pigments that are responsible 
for many of the yellow, orange and red colors seen in living organisms. 
Carotenoids are widely distributed in nature and have, in various living systems, 
two main biological functions: they serve as light-harvesting pigments in 

20 photosynthesis, and they protect against photooxidative damage. These and 
additional biological functions of carotenoids, their important industrial role, and 
their biosynthesis are discussed hereinbelow. 

As part of the light-harvesting antenna, carotenoids can absorb photons and 
transfer the energy to chlorophyll, thus assisting in the harvesting of light in the 

25 range of 450 - 570 nm [see, Cogdell RJ and Frank HA (1987) How carotenoids 
function in photosynthestic bacteria. Biochim Biophys Acta 895: 63-79; Cogdell R 
(1988) The function of pigments in chloroplasts. In: Goodwin TW (ed) Plant 
Pigments, pp 183-255. Academic Press, London; Frank HA, Violette CA, 
Trautman JK, Shreve AP, Owens TG and Albrecht AC (1991) Carotenoids in 

30 photosynthesis: structure and photochemistry. Pure Appl Chem 63 : 1 09- 1 1 4; Frank 
HA, Farhoosh R, Decoster B and Christensen RL (1992) Molecular features that 
control the efficiency of carotenoid-to-chlorophyll energy transfer in 
photosynthesis. In: Murata N (ed) Research in Photosynthesis, Vol I, pp 125-128. 
Kluwer, Dordrecht; and, Cogdell RJ and Gardiner AT (1993) Functions of 

35 carotenoids in photosynthesis. Meth Enzymol 214: 185-193]. Although 
carotenoids are integral constituents of the protein-pigment complexes of the light- 
harvesting antennae in photosynthetic organisms, they are also important 
components of the photosynthetic reaction centers. 
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Most of the total carotenoids is located in the light harvesting complex II 
[Bassi R. Pineaw B, Dainese P and Marquaitt J (1993) Carolenoid binding proteins 
of photosystem 11. Eur J Biochem 212: 297-302]. The identities of the 
photosynthetically active carotenoproteins and their precise location in lighl- 
5 harvesting systems are not known. Carotenoids in photochemically active 
chlorophyll-protein complexes of the thermophilic cyanobacterium Synechococcus 
sp. were investigated by linear dichroism spectroscopy of oriented samples [see, 
Breton .1 and Kato S (1987) Orientation of the pigments in photosystem 11: low- 
temperature linear-dichroism study of a core particle and of its chlorophyll-protein 

10 subunits isolated from Synechococcus sp. Biochim Biophys Acta 892: 99-107]. 
These complexes contained mainly a p-carotene pool absorbing around 505 and 
470 nm, which is oriented close to the membrane plane. In photochemically 
inactive chlorophyll-protein complexes, the p-carotene absorbs around 495 and 
465 nm, and the molecules are oriented perpendicular to the membrane plane. 

i.'i Evidence that carotenoids are associated with cyanobacterial photosystem 

(PS) II has been described [see, Suzuki R and Fujita Y (1977) Carotenoid 
photobleaching induced by the action of photosynthetic reaction center II: DCMU 
sensitivity. Plant Cell Physiol 18: 625-631; and, Newman PJ and Sherman LA 
(1978) Isolation and characterization of photosystem I and II membrane particles 

20 from the blue-green alga Synechococcus cedrorum. Biochim Biophys Acta 503: 
343-361]. There are two p-carotene molecules in the reaction center core of PS II 
[see, Ohno T, Satoh K and Katoh S (1986) Chemical composition of purified 
oxygen-evolving complexes from the thermophilic cyanobacterium Synechococcus 
sp. Biochim Biophys Acta 852: 1-8; Gounaris K, Chapman DJ and Barber J (1989) 

25 Isolation and characterization of a Dl/D2/cytochrome 6-559 complex from 
Synechocystis PCC6803. Biochim Biophys Acta 973: 296-301; and, Newell RW, 
van Amerongen H, Barber J and van Grondelle R (1993) Spectroscopic 
characterization of the reaction center of photosystem II using polarized light: 
Evidence for P-carotene exciters in PS II reaction centers. Biochim Biophys Acta 

30 1057: 232-238] whose exact flinction(s) is still obscure [reviewed by Satoh K 
(1992) Structure and ftinction of PS II reaction center. In: Murata N (ed) Research 
in Photosynthesis, Vol. II, pp. 3-12. Kluwer, Dordrecht]. It was demonstrated tliat 
these two coupled p-carotene molecules protect chlorophyll P680 from 
photodamage in isolated PS II reaction centers [see, De Las Rivas J, Telfer A and 

35 Barber J (1993) 2-coupled P-carotene molecules protect P680 from photodamage 
in isolated PS II reaction centers. Biochim. Biophys. Acta 1 142: 155-164], and this 
may be related to the protection against degradation of the Dl subunit of PS II 
[see, Sandmann G (1993) Genes and enzymes involved in the desaturation 
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reactions from phytoene to lycopene. (abstract), lOlh International Symposium on 
• Caroienoids, Trondheim CLl-2]. The light-harvesting pigments of a highly 
purified, oxygen-evolving PS 1] complex of the thermophilic cyanobacterium 
Synechococcus sp. consists of 50 chlorophyll o and 7 P-carotene, but no 
5 xanthophyll, molecules [see, Ohno T, Satoh K and Katoh S (1986) Chemical 
composition of purified oxygen-evolving complexes from the thermophilic 
cyanobacterium Synechococcus sp. Biochim Biophys Acta 852: 1-8]. p-carotene 
was shown to play a role in the assembly of an active PS II in green algae [see, 
Humbeck Romer S and Senger H (1989) Evidence for the essential role of 

10 carotenoids in the assembly of an active PS II. Planta 179: 242-250]. 

Isolated complexes of PS 1 from Phormidium luridum, which contained 40 
chlorophylls per P700, contained an average of 1 .3 molecules of P-carolene [see, 
Thornber JP, Alberte RS, Hunter FA, Shiozawa J A and Kan KS (1976) The 
organization of chlorophyll in the plant photosynthetic unit. Brookhaven Symp 

15 Biology 28: 132-148]. In a preparation of PS I particles from Synechococcus sp. 
strain PCC 6301, which contained 130 ± 5 molecules of antenna chlorophylls per 
P700, 16 molecules of carotenoids were detected [see, Lundell DJ, Glazer AN, 
Melis A and Malkin R (1985) Characterization of a cyanobacterial photosystem I 
complex. J Biol Chem 260: 646-654]. A substantial content of P-carotene and the 

20 xanthophylls cryptoxanthin and isocryptoxanthin were detected in PS I pigment- 
protein complexes of the thermophilic cyanobacterium Synechococcus elongatus 
[see, Coufal J, Hladik J and Sofrova D (1989) The carotenoid content of 
photosystem 1 pigment-protein complexes of the cyanobacterium Synechococcus 
elongatus. Pholosynthetica 23: 603-616]. A subunit protein-complex structure of 

25 PS I from the thermophilic cyanobacterium Synechococcus sp., which consisted of 
four polypeptides (of 62, 60, 14 and 10 kDa), contained approximately 10 p- 
carotene molecules per P700 [see, Takahashi Y, Hirota K and Katoh S (1985) 
Multiple forms of P700-chlorophyll o-protein complexes from Synechococcus sp.: 
the iron, quinone and carotenoid contents. Photosynth Res 6: 183-192]. This 

30 carotenoid is exclusively bound to the large polypeptides which carry the 
functional and antenna chlorophyll a. The fluorescence excitation spectrum of 
these complexes suggested that p-carotene serves as an efficient antenna for PS 1. 

As mentioned, an additional essential function of carotenoids is to protect 
against photooxidation processes in the photosynthetic apparatus that are caused 

35 by the excited triplet state of chlorophyll. Carotenoid molecules with 7t-electron 
conjugation of nine or more carbon-carbon double bonds can absorb triplet-state 
energy from chlorophyll and thus prevent the formation of harmful singlet-state 
oxygen radicals. In Synechococcus sp. the triplet state of carotenoids was 
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monitored in closed PS II centers and its rise kinetics of approximately 25 
nanoseconds is attributed to energy transfer from chlorophyll triplets in the antenna 
|see, Schlodder E and Bretiel K (1988) Primary charge separation in closed 
photosysiem I] with a lifetime of 1 1 nanoseconds. Flash-absorption spectroscopy 
5 with oxygen-evolving photosystem 11 complexes from Synechococcus . Biochim 
Biophys Acta 933: 22-34]. It is conceivable that this process, that has a lower 
yield compared to the yield of radical-pair formation, plays a role in protecting 
chlorophyll from damage due to over-excitation. 

The protective role of carotenoids in vivo has been elucidated through the 

10 use of bleaching herbicides such as norflurazon that inhibit carotenoid biosynthesis 
in all organisms performing oxygenic photosynthesis [reviewed by Sandmann G 
and Boger P (1989) Inhibition of carotenoid biosynthesis by herbicides. In: Boger 
P and Sandmann G (Eds.) Target Sites of Herbicide Action, pp 25-44. CRC Press, 
Boca Raton, Florida]. Treatment with norflurazon in the light results in a decrease 

15 of both carotenoid and chlorophyll levels, while in the dark, chlorophyll levels are 
unaffected. Inhibition of photosynthetic efficiency in cells of Oscillator ia agardhii 
that were treated with the pyridinone herbicide, fluridone, was attributed to a 
decrease in the relative abundance of myxoxanthophyll, zeaxanthin and P- 
carotene, which in turn caused photooxidation of chlorophyll molecules [see, 

20 Canto de Loura I, Dubacq JP and Thomas JC (1987) The effects of nitrogen 
deficiency on pigments and lipids of cianobacteria. Plant Physiol 83: 838-843]. 

It has been demonstrated in plants that zeaxanthin is required to dissipate, in 
a nonradiative manner, the excess excitation energy of the antenna chlorophyll 
[see, Demmig-Adams B (1990) Carotenoids and photoprotection in plants: a role 

25 for the xanthophyll zeaxanthin. Biochim Biophys Acta 1020: 1-24; and, Demmig- 
Adams B and Adams WW III (1990) The carotenoid zeaxanthin and high-energy- 
state quenching of chlorophyll fluorescence. Photosynth Res 25: 187-197]. In 
algae and plants a light-induced deepoxidation of violaxanthin to yield zeaxanthin, 
is related to photoprotection processes [reviewed by Demmig-Adams B and 

30 Adams WW III (1992) Photoprotection and other responses of plants to high light 
stress. Ann Rev Plant Physiol Plant Mol Biol 43: 599-626]. The light-induced 
deepoxidation of violaxanthin and the reverse reaction that takes place in the dark, 
are known as the "xanthophyll cycle" [see, Demmig-Adams B and Adams WW III 
(1992) Photoprotection and other responses of plants to high light stress. Ann Rev 

35 Plant Physiol Plant Mol Biol 43: 599-626]. Cyanobacterial lichens, that do not 
contain any zeaxanthin and that probably are incapable of radiationless energy 
dissipation, are sensitive to high light intensity; algal lichens that contain 
zeaxanthin are more resistant to high-light stress [see, Demmig-Adams B, Adams 
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WW ]]], Green TGA, Czygan FC and Lange OL (1990) Differences in the 
susceptibility to light stress in two lichens forming a phycosymbiodeme, one 
partner possessing and one lacking the xanthophyll cycle. Oecologia 84: 451-456: 
Demmig-Adams B and Adams WW III (1993) The xanthophyll cycle, protein 

5 turnoven and the high light tolerance of sun-acclimated leaves. Plant Physiol 103: 
1413-1420; and, Demmig-Adams B (1990) Carotenoids and photoproiection in 
plants: a role for the xanthophyll zeaxanthin. Biochim Biophys Acta 1020: 1-24]. 
In contrast to algae and plants, cyanobacteria do not have a xanthophyll cycle. 
However, they do contain ample quantities of zeaxanthin and other xanthophylls 

m that can support photoprotection of chlorophyll. 

Several other functions have been ascribed to carotenoids. The possibility 
thai carotenoids protect against damaging species generated by near ullra-violei 
(UV) irradiation is suggested by results describing the accumulation of (J-carotene 
in a UV-resistant mutant of the cyanobacterium Gloeocapsa alpicola [see, Buckley 

15 CE and Houghton J A (1976) A study of the effects of near UV radiation on the 
pigmentation of the blue-green alga Gloeocapsa alpicola. Arch Microbiol 107: 93- 
97]. This has been demonstrated more elegantly in Escherichia coli cells that 
produce carotenoids [see, Tuveson RW and Sandmann G (1993) Protection by 
cloned carotenoid genes expressed in Escherichia coli against phototoxic 

20 molecules activated by near-ultraviolet light. Meth Enzymol 214: 323-330]. Due 
to their ability to quench oxygen radical species, carotenoids are efficient anti- 
oxidants and thereby protect cells from oxidative damage. This function of 
carotenoids is important in virtually all organisms [see, Krinsky Nl (1989) 
Antioxidant functions of carotenoids. Free Radical Biol Med 7: 617-635; and, 

2.S Palozza P and Krinsky NI (1992) Antioxidant effects of carotenoids in vivo and in 
vitro - an overview. Meth Enzymol 213: 403-420]. Other cellular functions could 
be affected by carotenoids, even if indirectly. Although carotenoids in 
cyanobacteria are not the major photoreceptors for phototaxis, an influence of 
carotenoids on phototactic reactions, that have been observed in Anabaena 

30 variabilis, was attributed to the removal of singlet oxygen radicals that may act as 
signal intermediates in this system [see, Nultsch W and Schuchart H (1985) A 
model of the phototactic reaction chain of cyanobacterium Anabaena variabilis. 
Arch Microbiol 142: 180-184]. 

hi flowers and fruits carotenoids facilitate the attraction of pollinators and 

35 dispersal of seeds. This latter aspect is strongly associated with agriculture. The 
type and degree of pigmentation in fruits and flowers are among the most 
important traits of many crops. This is mainly since the colors of these products 
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often determine their appeal to the consumers and thus can increase their market 
worth. 

Carotenoids have important commercial uses as coloring agents in the food 
industry since they are non-toxic [see, Bauernfeind JC (1981) Carotenoids as 
colorants and vitamin A precursors. Academic Press, London]. The red color of 
the tomato fruit is provided by lycopene which accumulates during fruit ripening 
in chromoplasts. Tomato extracts, which contain high content (over 80% dry 
weight) of lycopene, are commercially produced worldwide for industrial use as 
food colorant. Furthermore, the flesh, feathers or eggs offish and birds assume the 
color of the dietary carotenoid provided, and thus carotenoids are frequently used 
in dietary additives for poultry and in aquaculiure. Certain cyanobacterial species, 
for example Spirulina sp. [see, Sommer TR, Potts WT and Morrissy NM (1990) 
Recent progress in processed microalgae in aquaculture. Hydrobiologia 204/205: 
435-443], are cultivated in aquaculture for the production of animal and human 
food supplements. Consequently, the content of carotenoids, primarily of (3- 
carotene, in these cyanobacteria has a major commercial implication in 
biotechnology. 

Most carotenoids are composed of a C40 hydrocarbon backbone, 
constructed from eight C5 isoprenoid units and contain a series of conjugated 
double bonds. Carotenes do not contain oxygen atoms and are either linear or 
cyclized molecules containing one or two end rings. Xanthophylls are oxygenated 
derivatives of carotenes. Various glycosilated carotenoids and carotenoid esters 
have been identified. The C40 backbone can be further extended to give C45 or 
C50 carotenoids, or shortened yielding apocarotenoids. Some nonphotosynthetic 
25 bacteria also synthesize C30 carotenoids. General background on carotenoids can 
be found in Goodwin TW (1980) The Biochemistry of the Carotenoids, Vol. 1, 2nd 
Ed. Chapman and Hall, New York; and in Goodwin TW and Britton G (1988) 
Distribution and analysis of carotenoids. In: Goodwin TW (ed) Plant Pigments, pp 
62-132. Academic Press, New York. 

More than 640 different naturally-occurring carotenoids have been so far 
characterized, hence, carotenoids are responsible for most of the various shades of 
yellow, orange and red found in microorganisms, fungi, algae, plants and animals. 
Carotenoids are synthesized by all photosynthetic organisms as well as several 
nonphotosynthetic bacteria and f\ingi, however they are also widely distributed 
through feeding throughout the animal kingdom. 

Carotenoids are synthesized de novo from isoprenoid precursors only in 
photosynthetic organisms and some microorganisms, they typically accumulate in 



20 



30 



BNSCXSCID: <WO_98ie910A1J.> 



wo 98/18910 




PCTAJS97/17819 



7 

protein complexes in the photosynthetic membrane, in the cell membrane and in 
the cell wall. 

As detailed in Figure 1, in the biosynthesis pathway of P-carotene, four 
enzymes convert geranylgeranyl pyrophosphate of the central isoprenoid pathway 
5 to P-carotene. Carotenoids are produced from the general isoprenoid biosynthetic 
pathway. While this pathway has been known for several decades, only recently, 
and mainly through the use of genetics and molecular biology, have some of the 
molecular mechanisms involved in carotenoids biogenesis, been elucidated. This 
is due to the fact that most of the enzymes which take part in the conversion of 

10 phytoene to carotenes and xanthophyils are labile, membrane-associated proteins 
that lose activity upon solubilization [see, Beyer P, Weiss G and Kleinig H (1985) 
Solubilization and reconstitution of the membrane-bound carotenogenic enzymes 
from daffodile chromoplasts. Eur J Biochem 153: 341-346; and, Bramley PM 
(1985) The in vitro biosynthesis of carotenoids. Adv Lipid Res 21: 243-279]. 

15 However, solubilization of carotenogenic enzymes , from Synechocystis sp. strain 
PCC 6714 that retain partial activity has been reported [see, Bramley PM and 
Sandmann G (1987) Solubilization of carotenogenic enzyme of Aphanocapsa. 
Phytochem 26: 1935-1939]. There is no genuine in vitro system for carotenoid 
biosynthesis which enables a direct essay of enzymatic activities. A cell-free 

20 carotenogenic system has been developed [see, Clarke IE, Sandmann G, Bramley 
PM and Boger P (1982) Carotene biosynthesis with isolated photosj'nthetic 
membranes. FEBS Lett 140: 203-206] and adapted for cyanobacteria [see, 
Sandmann G and Bramley PM (1985) Carotenoid biosynthesis by Aphanocapsa 
homogenates coupled to a phytoene-generating system from Phycomyces 

25 hlakesleeanus. Planta 164: 259-263; and, Bramley PM and Sandmann G (1985) Jn 
vitro and in vivo biosynthesis of xanthophyils by the cyanobacterium 
Aphanocapsa. Phytochem 24: 2919-2922]. Reconstitution of phytoene desaturase 
from Synechococcus sp. strain PCC 7942 in liposomes was achieved following 
purification of the polypeptide, that had been expressed in Escherichia coli [see, 

30 Fraser PD, Linden H and Sandmann G (1993) Purification and reactivation of 
recombinant Synechococcus phytoene desaturase from an overexpressing strain of 
Escherichia coli. Biochem J 291 : 687-692]. 

Referring now to Figure 1, carotenoids are synthesized from isoprenoid 
precursors. The central pathway of isoprenoid biosynthesis may be viewed as 

3.S beginning with the conversion of acetyl-CoA to mevalonic acid. D^-isopentenyl 
pyrophosphate (IPP), a C5 molecule, is formed from mevalonate and is the 
building block for all long-chain isoprenoids. Following isomerization of IPP to 
dimethylallyl pyrophosphate (DMAPP), three additional molecules of IPP are 
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combined to yield the C20 molecule, geranylgeranyl pyrophospJiaie (GGPP). 
These l'-4 condensation reactions are catalyzed by prenyl transferases [see. 
Kleinig H (1989) The role of plastids in isoprenoid biosynthesis. Ann Rev Plant 
Physiol Plant Mol Biol 40: 39-59]. There is evidence in plants that the same 
5 enzyme, GGPP synthase, carries out all the reactions from DMAPP to GGPP |see, 
Dogbo O and Camara B (1987) Purification of isopentenyl pyrophosphate 
isomerase and geranylgeranyl pyrophosphate synthase from Capsicum 
chromoplasts by affinity chromatography. Biochim Biophys Acta 920: 140-148; 
and, Laferriere A and Beyer P (1991) Purification of geranylgeranyl diphosphate 

10 synthase from Sinapis alba etioplasts. Biochim Biophys Acta 216:1 56-1 63]. 

The first step that is specific for carotenoid biosynthesis is the head-io-head 
condensation of two molecules of GGPP to produce prephytoene pyrophosphate 
(PPPP). Following removal of the pyrophosphate, GGPP is converted to \5-cis- 
phytoene, a colorless C40 hydrocarbon molecule. This two-step reaction is 

15 catalyzed by the soluble enzyme, phytoene synthase, an enzyme encoded by a 
single gene {crtB), in both cyanobacteria and plants [see, Chamovitz D, Misawa N, 
Sandmann G and Hirschberg J (1992) Molecular cloning and expression in 
Escherichia coli of a cyanobacterial gene coding for phytoene synthase, a 
carotenoid biosynthesis enzyme. FEES Lett 296: 305-310; Ray JA, Bird CR, 

20 Maunders M, Grierson D and Schuch W (1987) Sequence of pTOM5, a ripening 
related cDNA from tomato. Nucl Acids Res 15: 10587-10588; Camara B (1993) 
Plant phytoene synthase complex - component 3 enzymes, immunology, and 
biogenesis. Meth Enzymol 214: 352-365]. All the subsequent steps in the pathway 
occur in membranes. Four desaturation (dehydrogenation) reactions convert 

25 phytoene to lycopene via phytofluene, (;-carotene, and neurosporene. Each 
desaturation increases the number of conjugated double bonds by two such that 
the number of conjugated double bonds increases from three in phytoene to eleven 
in lycopene. 

Relatively little is known about the molecular mechanism of the enzymatic 
30 dehydrogenation of phytoene [see, Jones BL and Porter J W ( 1 986) Biosynthesis of 
carotenes in higher plants. CRC Crit Rev Plant Sci 3: 295-324; and, Beyer P. 
Mayer M and Kleinig H (1989) Molecular oxygen and the state of geometric 
iosomerism of intermediates are essential in the carotene desaturation and 
cyclization reactions in daffodil chromoplasts. Eur J Biochem 184: 141-150]. It 
35 has been established that in cyanobacteria, algae and plants the first two 
desaturations, from 1 5-c/j-phytoene to ^-carotene, are catalyzed by a single 
membrane-bound enzyme, phytoene desaturase [see, .lones BL and Porter JW 
(1986) Biosynthesis of carotenes in higher plants. CRC Crit R€v Plant Sci 3: 295- 
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324; and, Beyer P, Mayer M and Kleinig H (1989) Molecular oxygen and the state 
of geometric iosomerism of intermediates are essential in the carotene desaturation 
and cyclization reactions in daffodil chromoplasts. Eur J Biochem 184: 141-150]. 
Since the (;-carotene product is mostly in the alUtrans configuration, a cis-irans 
5 isomerization is presumed at this desaturation step. The primary structure of the 
phytoene desaturase polypeptide in cyanobacteria is conserved (over 65% identical 
residues) with that of algae and plants [see. Pecker 1, Chamovitz D, Linden H, 
Sandmann G and Hirschberg J (1992) A single polypeptide catalyzing the 
conversion of phytoene to (^-carotene is transcriptionally regulated during tomato 

10 fruit ripening. Proc Natl Acad Sci USA 89: 4962-4966; Pecker 1, Chamovitz D, 
Mann V, Sandmann G, Boger P and Hirschberg J (1993) Molecular 
characterization of carotenoid biosynthesis in plants: the phytoene desaturase gene 
in tomato. In: Murata N (ed) Research in Photosynthesis, Vol 111, pp 11-18. 
Kluwer. Dordrectht]. Moreover, the same inhibitors block phytoene desaturase in 

15 the two systems [see, Sandmann G and Boger P (1989) Inhibition of carotenoid 
biosynthesis by herbicides. In: Boger P and Sandmann G (eds) Target Sites of 
Herbicide Action, pp 25-44. CRC Press, Boca Raton, Florida]. Consequently, it is 
very likely that the enzymes catalyzing the desaturation of phytoene and 
phytofluene in cyanobacteria and plants have similar biochemical and molecular 

20 properties, that are distinct from those of phytoene desaturases in other 
microorganisms. One such a difference is that phytoene desaturases from 
Rhodobacter capsulatiis, Erwinio sp. or fungi convert phytoene to neurosporene, 
lycopene, or 3,4-dehydrolycopene, respectively. 

Desaturation of phytoene in daffodil chromoplasts [see, Beyer P, Mayer M 

25 and Kleinig H (1989) Molecular oxygen and the stale of geometric iosomerism of 
intermediates are essential in the carotene desaturation and cyclization reactions in 
daffodil chromoplasts. Eur J Biochem 184: 141-150], as well as in a cell free 
system of Synechococcus sp. strain PCC 7942 [see, Sandmann G and Kowalczyk S 
(1989) In vitro carotenogenesis and characterization of the phytoene desaturase 

30 reaction in Anacystis. Biochem Biophys Res Com 163: 916-921], is dependent on 
molecular oxygen as a possible final electron acceptor, although oxygen is not 
directly involved in this reaction. A mechanism of dehydrogenase-electron 
transferase was supported in cyanobacteria over dehydrogenation mechanism of 
dehydrogenase-monooxygenase [see, Sandmann G and Kowalczyk S (1989) Jn 

35 vitro carotenogenesis and characterization of the phytoene desaturase reaction in 
Anacysiis. Biochem Biophys Res Com 163: 916-921]. A conserved FAD-binding 
motif exists in all phytoene desaturases whose primary structures have been 
analyzed [see. Pecker I, Chamovitz D, Linden H, Sandmann G and Hirschberg J 
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( 1 992) A single polypeptide catalyzing the conversion ofphytoene to C-caroiene is 
transcriptionally regulated during tomato fruit ripening. Proc Natl Acad Sci USA 
89: 4962-4966; Pecker 1, Chamovitz D, Mann V, Sandmann G. Boger P and 
Hirschberg J (1993) Molecular characterization of carotenoid biosynthesis in 
5 plants: the phytoene desaturase gene in tomato. In: Murata N (ed) Research in 
Photosynthesis, Vol III, pp 11-18. Kluwer, Dordrectht]. The phytoene desaturase 
enzyme in pepper was shown to contain a protein-bound FAD [see, l lugueney P. 
Romer S, Kuntz M and Camara B (1992) Characterization and molecular cloning 
of a llavoprotein catalyzing the synthesis of phytofluene and C-carotene in 

10 Capsicum chromoplasts. Eur J Biochem 209: 399-407]. Since phytoene desaturase 
is located in the membrane, an additional, soluble redox component is predicted. 
This hypothetical component could employ NAD(P)+, as suggested [see, Mayer 
MP, Nievelstein V and Beyer P (1992) Purification and characterization of a 
NADPH dependent oxidoreductase from chromoplasts oi' Narcissus 

1.5 pseudonarcissus - a redox-mediator possibly involved in carotene desaiuration. 
Plant Physiol Biochem 30: 389-398] or another electron and hydrogen carrier, such 
as a quinone. The cellular location of phytoene desaturase in Synechocystis sp. 
strain PCC 6714 and Anabaena variabilis strain ATCC 29413 was determined 
with specific antibodies to be mainly (85%) in the photosynthetic thylakoid 

20 membranes [see, Serrano A, Gimenez P, Schmidt A and Sandmann G (1990) 
Immunocytochemical localization and functional determination of phytoene 
desaturase in photoautotrophic prokaryotes. J Gen Microbiol 136: 2465-2469]. 

In cyanobacteria algae and plants (^-carotene is converted to lycopene via 
neurosporene. Very little is known about the enzymatic mechanism, which is 

25 predicted to be carried out by a single enzyme [see. Linden H, Vioque A and 
Sandmann G (1993) Isolation of a carotenoid biosynthesis gene coding for C,- 
carotene desaturase from Anabaena PCC 7120 by heterologous complementation. 
FEMS Microbiol Lett 106: 99-104]. The deduced amino acid sequence of C,- 
carotene desaturase in Anabaena sp. strain PCC 7120 contains a dinucleotide- 

30 binding motif that is similar to the one found in phytoene desaturase. 

Two cyclization reactions convert lycopene to (3-carotene. Evidence has 
been obtained that in Synechococcus sp. strain PCC 7942 [see, Cunningham FX Jr. 
Chamovitz D, Misawa N, Gantt E and Hirschberg J (1993) Cloning and functional 
expression in Escherichia coli of a cyanobacterial gene for lycopene cyclase, the 

35 enzyme that catalyzes the biosynthesis of P-carotene. FEES Lett 328: 130-138], as 
well as in plants [see, Camara B and Dogbo O (1986) Demonstration and 
solubilization of lycopene cyclase from Capsicum chromoplast membranes. Plant 
Physiol 80: 172-184], these two cyclizations are catalyzed by a single enzyme. 
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lycopene cyclase. This membrane-bound enzyme is inhibited by the triethylamine 
compounds, CPTA and MPTA [see, Sandmann G and Boger P (1989) Inhibition of 
carotenoid biosynthesis by herbicides. In: Boger P and Sandmann G (eds) Target 
Sites of Herbicide Action, pp 25-44. CRC Press, Boca Raton, Florida]. 
Cyanobacteria carry out only the p-cyclization and therefore do not contain e- 
caroiene, 8-carotene and a-carotene and their oxygenated derivatives. The P-ring 
is formed through the formation of a "carbonium ion" intermediate when the C-1,2 
double bond at the end of the linear lycopene molecule is folded into the position 
of the C-5,6 double bond, followed by a loss of a proton from C-6. No cyclic 

10 carotene has been reported in which the 7,8 bond is not a double bond. Therefore, 
full desaturation as in lycopene, or desaturation of at least half-molecule as in 
neurosporene, is essential for the reaction. Cyclization of lycopene involves a 
dehydrogenation reaction that does not require oxygen. The cofactor for this 
reaction is unknown. A dinucleotide-binding domain was found in the lycopene 

I ."i cyclase polypeptide of Synechococcus sp. strain PCC 7942, implicating NAD(P) or 
FAD as coenzymes with lycopene cyclase. 

The addition of various oxygen-containing side groups, such as hydroxy-, 
methoxy-, oxo-, epoxy-, aldehyde or carboxylic acid moieties, form the various 
xanthophyll species. Little is known about the formation of xanthophylls. 

20 Hydroxylation of p-carotene requires molecular oxygen in a mixed-function 
oxidase reaction. 

Clusters of genes encoding the enzymes for the entire pathway have been 
cloned from the purple photosynthetic bacterium Rhodobacter capsulatus [see, 
Armstrong GA, Alberti M, Leach F and Hearst JE (1989) Nucleotide sequence, 

25 organization, and nature of the protein products of the carotenoid biosynthesis 
gene cluster of Rhodobacter capsulatus. Mol Gen Genet 216: 254-268] and from 
the nonphotosynthetic bacteria Erwinia herbicola [see, Sandmann G, Woods WS 
and Tuveson RW (1990) Identification of carotenoids in Erwinia herbicola and in 
transformed Escherichia coli strain. FEMS Microbiol Lett 71: 77-82: Hundle BS, 

30 Beyer P, Kleinig H, Englert H and Hearst JE (1991) Carotenoids of Erwinia 
herbicola and an Escherichia coli HBlOl strain carrying the Erwinia herbicola 
carotenoid gene cluster. Photochem Photobiol 54: 89-93; and, Schnurr G, Schmidt 
A and Sandmann G (1991) Mapping of a carotenogenic gene cluster from Erwinia 
herbicola and functional identification of six genes. FEMS Microbiol Lett 78: 157- 

35 162] and Erwinia uredovora [see, Misawa N, Nakagawa M, Kobayashi K. 
Yamano S, Izawa I, Nakamura K and Harashima K (1990) Elucidation of the 
Erwinia uredovora carotenoid biosynthetic pathway by functional analysis of 
gene products in Escherichia coli. J Bacteriol 172: 6704-6712]. Two genes, al-3 
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for GGPP synthase [see. Nelson MA, Morelli G, Carattoli A, Romano N and 
Macino G (1989) Molecular cloning of a Neurospora crassa carotenoid 
biosynthetic gene {albino-3) regulated by blue light and the products of the white 
collar genes. Mol Cell Biol 9: 1271-1276; and, Carattoli A, Romano N, Ballario P, 

5 Morelli G and Macino G (1991) The Neurospora crassa carotenoid biosynthetic 
gene (albino 3). J Biol Chem 266: 5854-5859] and al-J for phytoene desaturase 
[see, Schmidhauser TJ, Lauter FR, Russo VEA and Yanoftky C (1990) Cloning 
sequencing and photoregulation of al-J^ a carotenoid biosynthetic gene of 
Neurospora crassa. Mol Cell Biol 10: 5064-5070] have been cloned from the 

10 fungus Neurospora crassa. However, attempts at using these genes as 
heterologous molecular probes to clone the corresponding genes from 
cyanobacteria or plants were unsuccessful due to lack of sufficient sequence 
similarity. 

The first "plant-type" genes for carotenoid synthesis enzyme were cloned 

15 from cyanobacteria using a molecular-genetics approach. In the first step towards 
cloning the gene for phytoene desaturase, a number of mutants that are resistant to 
the phytoene-desaturase-specific inhibitor, norflurazon, were isolated in 
Synechococcus sp. strain PCC 7942 [see. Linden H, Sandmann G, Chamovitz D, 
Hirschberg J and Boger P (1990) Biochemical characterization o1l Synechococcus 

20 mutants selected against the bleaching herbicide norflurazon. Pestic Biochem 
Physiol 36: 46-51]. The gene conferring norflurazon-resistance was then cloned 
by transforming the wild-type strain to herbicide resistance [see, Chamovitz D, 
Pecker 1 and Hirschberg .1 (1991) The molecular basis of resistance to the herbicide 
norflurazon. Plant Mol Biol 16: 967-974; Chamovitz D, Pecker 1, Sandmann G, 

2.5 Boger P and Hirschberg J (1990) Cloning a gene for norflurazon resistance in 
cyanobacteria. Z Naturforsch 45c: 482-486]. Several lines of evidence indicated 
that the cloned gene, formerly called pds and now named cr/P, codes for phytoene 
desaturase. The most definitive one was the functional expression of phytoene 
desaturase activity in transformed Escherichia coli cells [see. Linden H, Misawa 

30 N, Chamovitz D, Pecker I, Hirschberg J and Sandmann G (1991) Functional 
complementation in Escherichia coli of different phytoene desaturase genes and 
analysis of accumulated carotenes. Z Naturforsch 46c: 1045-1051; and. Pecker 1, 
Chamovitz D, Linden H, Sandmann G and Hirschberg J (1992) A single 
polypeptide catalyzing the conversion of phytoene to d;-carotene is 

35 transcriptionally regulated during tomato fruit ripening. Proc Natl Acad Sci USA 
89: 4962-4966]. The crtP gene was also cloned from Synechocystis sp. strain 
PCC 6803 by similar methods [see, Martinez-Ferez IM and Vioque A (1992) 
Nucleotide sequence of the phytoene desaturase gene from Synechocystis sp. PCC 
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6803 and characterization of a new mutation which confers resistance to the 
herbicide norflurazon. Plant Mol Biol 18: 981-983]. 

The cyanobacterial crtP gene was subsequently used as a molecular probe 
for cloning the homologous gene from an alga [see, Pecker 1, Chamovitz D, Mann 
5 V, Sandmann G, Boger P and Hirschberg J (1993) Molecular characterization of 
caroienoid biosynthesis in plants: the phytoene desaturase gene in tomato. In: 
Murata N (ed) Research in Photosynthesis, Vol III, pp 1 1-18. Kluwer, Dordrectht] 
and higher plants [see, Bartley GE, Viitanen PV, Pecker L Chamovitz D, 
Hirschberg J and Scolnik PA (1991) Molecular cloning and expression in 

!0 photosynthetic bacteria of a soybean cDNA coding for phytoene desaturase, an 
enzyme of the carotenoid biosynthesis pathway. Proc Natl Acad Sci USA 88: 
6532-6536; and. Pecker 1, Chamovitz Linden H, Sandmann G and Hirschberg J 
(1992) A single polypeptide catalyzing the conversion of phytoene to (^-carotene is 
transcriptionally regulated during tomato fruit ripening. Proc Natl Acad Sci USA 

15 89: 4962-4966]. The phytoene desaturases in Synechococcus sp. strain PCC 7942 
and Synechocystis sp. strain PCC 6803 consist of 474 and 467 amino acid residues, 
respectively, whose sequences are highly conserved (74% identities and 86% 
similarities). The calculated molecular mass is 51 kDa and, ahhough it is slightly 
hydrophobic (hydropathy index -0.2), it does not include a hydrophobic region 

20 which is long enough to span a lipid bilayer membrane. The primary structure of 
the cyanobacterial phytoene desaturase is highly conserved with the enzyme from 
the green alga Dunalliela bardawil (61% identical and 81% similar; [see. Pecker 1, 
Chamovitz D, Mann V, Sandmann G, Boger P and Hirschberg J (1993) Molecular 
characterization of carotenoid biosynthesis in plants: the phytoene desaturase gene 

25 in tomato. In: Murata N (ed) Research in Photosynthesis, Vol 111, pp 11-18. 
Kluwer, Dordrectht]) and from tomato [see. Pecker 1, Chamovitz D. Linden H, 
Sandmann G and Hirschberg J (1992) A single polypeptide catalyzing the 
conversion of phytoene to i^-carotene is transcriptionally regulated during tomato 
fruit ripening. Proc Natl Acad Sci USA 89: 4962-4966], pepper [see, Hugueney P, 

30 Romer S, Kuntz M and Camara B (1992) Characterization and molecular cloning 
of a flavoprotein catalyzing* the synthesis of phytofluene and i^-carotene in 
Capsicum chromoplasts. Eur J Biochem 209: 399-407] and soybean [see, Bartley 
GE, Viitanen PV, Pecker 1, Chamovitz D, Hirschberg J and Scolnik PA (1991) 
Molecular cloning and expression in photosynthetic bacteria of a soybean cDNA 

35 coding for phytoene desaturase, an enzyme of the carotenoid biosynthesis 
pathway. Proc Natl Acad Sci USA 88: 6532-6536] (62-65% identical and --79% 
similar; [see, Chamovitz D (1993) Molecular analysis of the early steps of 
carotenoid biosynthesis in cyanobacteria: Phytoene synthase and phytoene 
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desaiurase. Ph.D. Thesis, The Hebrew University of Jerusalem]). The eukaryotic 
phyioene desaturase polypeptides are larger (64 kDa); however, they are processed 
during import into the plastids to mature forms whose sizes are comparable to 
those of the cyanobacterial enzymes. 
5 There is a high degree of structural similarity in carotenoid enzymes of 

Rhodobacter capsulatus, Erwinia sp. and Neurospora crassa [reviewed in 
Armstrong GA, Hundle BS and Hearst JE (1993) Evolutionary conservation and 
structural similarities of carotenoid biosynthesis gene products from 
photosynthetic and nonphotosynihetic organisms. Meth Enzymol 214: 297-31 1], 

10 including in the crti gene-product, phytoene desaiurase. As indicated above, a 
high degree of conservation of the primary structure of phytoene desaturases also 
exists among oxygenic photosynthetic organisms. However, there is little 
sequence similarity, except for the FAD binding sequences at the amino termini, 
between the "plant-type" crtP gene products and the "bacterial-type" phytoene 

i.s desaturases {crtl gene products; 19-23% identities and 42-47% similarities). It has 
been hypothesized that crtP and crtl are not derived from the same ancestral gene 
and that they originated independently through convergent evolution fsee. Pecker 
1, Chamovitz D, Linden H, Sandmann G and Hirschberg J (1992) A single 
polypeptide catalyzing the conversion of phytoene to ^-carotene is 

20 transcriptionally regulated during tomato fruit ripening. Proc Natl Acad Sci USA 
89: 4962-4966]. This hypothesis is supported by the different dehydrogenation 
sequences that are catalyzed by the two types of enzymes and. by their different 
sensitivities to inhibitors. 

Although not as definite as in the case of phytoene desaturase, a similar 

25 distinction between cyanobacteria and plants on the one hand and other 
microorganisms is also seen in the structure of phytoene synthase. The crtB gene 
(formerly psy) encoding phytoene synthase was identified in the genome of 
Synechococcus sp. strain PCC 7942 adjacent to crtP and within the same operon 
[see, Hartley GE, Viitanen PV, Pecker 1, Chamovitz D, Hirschberg J and Scolnik 

30 PA (1991) Molecular cloning and expression in photosynthetic bacteria of a 
soybean cDNA coding for phytoene desaturase, an enzyme of the carotenoid 
biosynthesis pathway. Proc Natl Acad Sci USA 88: 6532-6536]. This gene 
encodes a 36-kDa polypeptide of 307 amino acids with a hydrophobic index of - 
0.4. The deduced amino acid sequence of the cyanobacterial phytoene synthase is 

3."; highly conserved with the tomato phytoene synthase (57% identical and 70% 
similar: Ray JA, Bird CR, Maunders M, Grierson D and Schuch W (1987) 
Sequence of pTOM5, a ripening related cDNA from tomato. Nucl Acids Res 15: 
10587-10588]) but is less highly conserved with the crtB sequences from other 
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bacteria (29-32% identical and 48-50% similar with ten gaps in the alignment). 
Both types of enzymes contain two consei-ved sequence motifs also found in 
prenyl transferases from diverse organisms [see. Hartley GE, Viitanen PV. Pecker 
L Chamovitz D, Hirschberg J and Scolnik PA (1991) Molecular cloning and 
5 expression in photosynthetic bacteria of a soybean cDNA coding for phytoene 
desaturase, an enzyme of the carotenoid biosynthesis pathway. Proc Natl Acad Sci 
USA 88: 6532-6536; Caratloli A, Romano N, Ballario P, Morelli G and Macino G 
(1991) The Neurospora crassa carotenoid biosynthetic gene (albino 3). J Biol 
Chem 266: 5854-5859; Armstrong GA, Hundle BS and Hearst JE (1993) 

10 Evolutionary conservation and structural similarities of carotenoid biosynthesis 
gene products from photosynthetic and nonphotosynthetic organisms. Meth 
Enzymol 214: 297-3 11; Math SK, Hearst JE and Poulter CD (1992) The criE gene 
in Erwinia herbicola encodes geranylgeranyl diphosphate synthase. Proc Natl 
Acad Sci USA 89: 6761-6764; and, Chamovitz D (1993) Molecular analysis of the 

15 early steps of carotenoid biosynthesis in cyanobacteria: Phytoene synthase and 
phytoene desaturase. Ph.D. Thesis, The Hebrew Univei-sity of Jerusalem]. It is 
conceivable that these regions in the polypeptide are involved in the binding and/or 
removal of the pyrophosphate during the condensation of two GGPP molecules. 

The crtQ gene encoding ^-carotene desaturase (formerly zds) v^&s cloned 

20 from Anabaena sp. strain PCC 7120 by screening an expression library of 
cyanobacterial genomic DNA in cells of Escherichia coli carrying the Erwinia sp. 
crtB and crtE genes and the cyanobacterial crtP gene [see. Linden H, Vioque A 
and Sandmann G (1993) Isolation of a carotenoid biosynthesis gene coding for C,- 
carotene desaturase from Anabaena PCC 7120 by heterologous complementation. 

25 FEMS Microbiol Lett 106: 99-104]. Since these Escherichia coli cells produce C,- 
caroiene, brownish-red pigmented colonies that produced lycopene could be 
identified on the yellowish background of cells producing (;-carotene. The 
predicted i;-carotene desaturase from Anabaena sp. strain PCC 7120 is a 56-kDa 
polypeptide which consists of 499 amino acid residues. Surprisingly, its primary 

30 structure is not conserved with the "plant-type" {crtP gene product) phytoene 
desaturases, but it has considerable sequence similarity to the bacterial-type 
enzyme {crti gene product) [see, Sandmann G (1993) Genes and enzymes 
involved in the desaturation reactions from phytoene to lycopene. (abstract), 1 0th 
International Symposium on Carotenoids, Trondheim CLl-2]. It is possible that 

35 the cyanobacterial crtQ gene and crtl gene of other microorganisms originated in 
evolution from a common ancestor. 

The crtL gene for lycopene cyclase (formerly Icy) was cloned from 
Synechococcus sp. strain PCC 7942 utilizing essentially the same cloning strategy 
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as for crtP. By using an inhibitor of lycopene cyclase, 2-(4-methylphenoxy)- 
trieihyiamine hydrochloride (MPTA), the gene was isolated by transformation of 
the wild-iype to herbicide-resistance [see, Cunningham FX Jr, Chamovitz D, 
Misawa N, Gantt E and Hirschberg J (1993) Cloning and functional expression in 
Escherichia coli of a cyanobacterial gene for lycopene cyclase, the enzyme that 
catalyzes the biosynthesis of P-carotene. FEES Lett 328: 130-138]. Lycopene 
cyclase is the product of a single gene product and catalyzes the double cyclization 
reaction of lycopene to p-carotene. The crtL gene product in Synechococcus sp. 
strain PCC 7942 is a 46-kDa polypeptide of 41 1 amino acid residues. It has no 
sequence similarity to the crtY gene product (lycopene cyclase) from Erwinia 
uredovora or Erwinia herbicola. 

The gene for p-carotene hydroxylase (cr/Z) and zeaxanthin glycosilase 
(crtX) have been cloned from Erwinia herbicola [see, Hundle B, Alberti M, 
Nievelstein V, Beyer P, Kleinig H, Armstrong GA, Burke DH and Hearst .IE 
(1994) Functional assignment of Erwinia herbicola EholO carotenoid genes 
expressed in Escherichia coli. Mol Gen Genet 254: 406-416; Hundle BS, Obrien 
DA. Alberti M, Beyer P and Hearst JE (1992) Functional expression of zeaxanthin 
glucosyltransferase from Erwinia herbicola and a proposed diphosphate binding 
site. Proc Natl Acad Sci USA 89: 9321-9325] and from Erwinia uredovora [see, 
20 Misawa N, Nakagawa M, Kobayashi K, Yamano S, Izawa I, Nakamura K and 
Harashima K (1990) Elucidation of the Erwinia uredovora carotenoid biosynthetic 
pathway by functional analysis of gene products in Escherichia coli. J Bacteriol 
172: 6704-6712]. 

The ketocarotenoid astaxanthin (3,3'-dihydroxy-p,p-carotene-4,4'-dione) 
2.5 was first described in aquatic crustaceans as an oxidized form of P-carotene. 
Astaxanthin was later found to be very common in many marine animals and 
algae. However, only few animals can synthesize astaxanthin de novo from other 
carotenoids and most of them obtain it in their food. In the plant kingdom, 
astaxanthin occurs mainly in some species of cyanobacteria, algae and lichens. 
30 However, it is found rarely also in petals of higher plant species [see, Goodwin 
TW (1980) The Biochemistry of the carotenoids. Vol. 1. 2nd Ed, Chapman and 
Hall, London and New York]. 

The function of astaxanthin as a powerful antioxidant in animals has been 
demonstrated [see, Miki W (1991) Biological functions and activities of animal 
3.^5 carotenoids. Pure Appl Chem 63:141). Astaxanthin is a strong inhibitor of lipid 
peroxidation and has been shown to play an active role in the protection of 
biological membranes from oxidative injury [see, Palozza P and Krinsky Nl (1992) 
Antioxidant effects of carotenoids in vivo and in vitro - an overview. Methods 
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Enzymol 213: 403-420; and, Kurashige M, Okimasu E, Inove M and Utsumi K 
(1990) Inhibition of oxidative injury of biological membranes by astaxanthin. 
Physiol Chem Phys Med NMR 22: 27]. The chemoprevenlive efftcts ol' 
aslaxanthin have also been investigated in which astaxanthin was shown to 
5 significantly reduce the incidence of induced urinary bladder cancer in mice [see, 
Tanaka Morishiia Y, Suzui M, Kojima T, Okumura A. and Mori H (1994). 
Chemoprevention of mouse urinary bladder carcinogenesis by the naturally 
occurring carotenoid astaxanthin. Carcinogenesis 15: 15]. It has also been 
demonstrated that astaxanthin exerts immunomodulating effects by enhancing 

10 antibody production [see, Jyonouchi H, Zhang L and Tomita Y (1993) Studies of 
immunomodulating actions of carotenoids. II. Astaxanthin enhances in vitro 
antibody production to T-dependent antigens without facilitating polyclonal B-cell 
activation. Nutr Cancer 19: 269; and, Jyonouchi H, Hill JR, Yoshiftimi T and 
Good RA (1991) Studies of immunomodulating actions of carotenoids. I, Effects 

15 of P-carotene and astaxanthin on murine lymphocyte functions and cell surface 
marker expression in-v/Yro culture system. Nutr Cancer 16: 93]. The complete 
biomedical properties of astaxanthin remain to be elucidated, but initial results 
suggest that it could play an important role in cancer and tumor prevention, as 
well as eliciting a positive response from the immune system. 

Astaxanthin is the principal carotenoid pigment of salmonids and shrimps 
and imparts attractive pigmentation in the eggs, flesh and skin [see, Torrisen OJ, 
Hardy RW, Shearer ICD (1989) Pigmentation of salmonid-carotenoid deposition 
and metabolism in salmonids. Crit Rev Aquatic Sci 1: 209]. The world-wide 
harvest of salmon in 1991 was approximately 720,000 MT.. of which 25-30% 

25 were produced in a variety of aquaculture facilities [see, Meyers SP (1994) 
Developments in world aquaculture, feed formulations, and role of carotenoids. 
Pure Appl Chem 66: 1069]. This is set to increase up to 460,000 MT. by the year 
2000 [see, Bjorndahl T (1990) The Economics of Salmon Aquaculture. Blackwell 
Scientific, Oxford, pp. 1]. The red coloration of the salmonid flesh contributes to 

30 consumer appeal and therefore affects the price of the final product. Animals 
cannot synthesize carotenoids and they acquire the pigments through the food 
chain from the primary producers - marine algae and phytoplankton. Those grown 
in intensive culture usually suffer from suboptimal color. Consequently, 
carotenoid-containing nourishment is artificially added in aquaculture, at 

35 considerable cost to the producer. 

Astaxanthin is the most expensive commercially used carotenoid compound 
(todays- 1995 market value is of 2,500-3,500 $/kg). It is utilized mainly as 
nutritional supplement which provides pigmentation in a wide variety of aquatic 
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animals. In the Far-East it is used also for feeding poultry to yield a typical 
pigmentation of chickens. It is also a desirable and effective nontoxic coloring for 
the food industry and is valuable in cosmetics. Recently it was reported that 
astaxanthin is a potent antioxidant in humans and thus is a desirable food additive. 

Natural (3^,3'^ astaxanthin is limited in availability. It is commercially 
extracted from some Crustacea species [see, Torrisen OJ, Hardy RW, Shearer KD 
(1989) Pigmentation of salmonid-carotenoid deposition and metabolism in 
salmonids. Crit Rev Aquatic Sci 1: 209]. The {3R,3'R) stereoisomer of 
astaxanthin is produced from Phaffia [a yeast specie, see, Andrewes AG, Phaff HJ 
and Starr MP (1976) Carotenoids of Phaffia rhodozyma, a red-pigmented 
fermenting yeast. Phytochemistry Vol. 15, pp. 1003-1007]. Synthetic astaxanthin. 
comprising a 1:2:1 mixture of the (35,3'^-, {3S,yRy and (3/?,3'/?)-isomers is now 
manufactured by Hoffman-La Roche and sold at a higli price (ca. $2,500/Kg) 
under the name "CAROPHYLL Pink" [see, Mayer H (1994) Reflections on 
carotenoid synthesis. Pure & Appl Chem, Vol. 66, pp. 931-938]. Recently a novel 
gene involved in ketocompound biosynthesis, designated criW was isolated Irom 
the marine bacteria Agrobacterium auranticacum and Alcaligenes PC-1 that 
produce ketocarotenoids such as astaxanthin. When the cr/PTgene was introduced 
into engineered Eschrichia coli that accumulated p-carotene due to Erwima 
carotenogenic genes, the Escherichia coli transformants synthesized canthaxanthin 
a precursor in the synthetic pathway of astaxanthin [see, Misawa N, Kajiwara S. 
Kondo K, Yokoyama A, Satomi Y, Saito T, Miki W and Ohtani T (1995) 
Canthaxanthin biosynthesis by the conversion of methylene to keto groups in a 
hydrocarbon p-carotene by a single gene. Biochemical and biophysical research 
25 communications Vol. 209, pp. 867-876]. It is therefore desirable to find a 
relatively inexpensive source of {3S,yS) astaxanthin to be used as a feed 
supplement in aquaculture and as a valuable chemical for various other industrial 
uses. 

Although astaxanthin is synthesized in a variety of bacteria, fungi and algae. 

30 the key limitation to the use of biological systems for its production is the low 
yield of and costly extraction methods in these systems compared to chemical 
synthesis. One way to solve these problems is to increase the productivity of 
astaxanthin production in biological systems using recombinant DNA technology. 
This allows for the production of astaxanthin in genetically engineered host which. 

35 in the case of a higher plant, is easy to grow and simple to extract. Furthermore, 
production of astaxanthin in genetically engineered host enables by appropriate 
host selection to use thus produced astaxanthin in for example aquaculture 
applications, devoid of the need for extraction. 



20 
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There is thus a widely recognized need for, and il would be highl}' 
advantageous lo have, a nucleic acid segment which encodes p-C-4-oxygenase, the 
enzyme that converts P-carotene to canthaxanthin, as well as recombinant vector 
molecules comprising a nucleic acid sequence according to the invention, and host 
5 cells or transgenic organisms transformed or transfected with these vector 
molecules or DNA segment for the biotechnological production ol^ {3S,yS) 
astaxanthin. 

Other features and advantages of the invention will be apparent from the 
following description and from the claims. 

10 

SUMM ARY i^F TfclEJMY^^ 

It is a general object of this invention to provide a biotechnological method 
for production of (35,3'5) astaxanthin. 

It is a specific object of the invention to provide a peptide having a p-C-4- 
15 oxygenase activity and a DNA segment coding for this peptide to enable a 
biotechnological production of astaxanthin and other xanthophylls. 

It is a further object of the invention to provide an RNA segments coding 
for a polypeptide comprising an amino acid sequence corresponding to above 
described peptide. 

20 It is yet a further object of the invention to provide a recombinant DNA 

molecule comprising a vector and the DNA segment as described above. 

It is still a further object of the invention to provide a host cell containing 
the above described recombinant DNA molecule. 

It is another object of the invention to provide a host transgenic organism 
25 containing the above described recombinant DNA molecule or the above described 
DNA segment in its cells. 

It is still another object of the invention to provide a host transgenic 
organism which expresses (3-C-4-oxygenase activity in chloroplasts and/or 
chromoplasts-containing tissues. 
30 It is yet another object of the invention to provide a food additive for animal 

or human consumption comprising the above described host cell or transgenic 
organism. 

It is still another object of the invention to provide a method of producing 
astaxanthin using the above described host cell or transgenic organism. 
35 It is a farther object of the invention to provide a method of producing 

canthaxanthin, echinenone, cryptoxanthin, isocryptoxanthin hydroxyechinenone, 
zeaxanthin, adonirubin, and/or adonixanthin using the above described host cell or 
transgenic organism. 
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Further objects and advantages of the present invention will be clear from 
the description that follows. 

In one embodiment, the present invention relates to a DNA segment coding 
for a polypeptide comprising an amino acid sequence corresponding to 
Haemaiococcus pluvial is crtO gene. 

In a further embodiment, the present invention relates to an RNA segment 
coding for a polypeptide comprising an amino acid sequence corresponding to 
Haemalococcus pluvialis crlO gene. 

In yet another embodiment, the present invention relates to a polypeptide 
comprising an amino acid sequence corresponding to a Haemalococcus pluvialis 
crtO gene. 

In a further embodiment, the present invention relates to a recombinant 
DNA molecule comprising a vector and a DNA segment coding for a polypeptide, 
corresponding to a Haemalococcus pluvialis crtO gene. 

In another embodiment, the present invention relates to a host cell 
containing the above described recombinant DNA molecule or DNA segment. 

In a further embodiment, the present invention relates to a host transgenic 
organism containing the above described recombinant DNA molecule or the above 
described DNA segment in its cells. 

In another embodiment, the present invention relates to a method of 
producing astaxanthin using the above described host cell or transgenic organism. 

In yet another embodiment, the present invention relates to a method of 
producing other xanthophylls. 

In still another embodiment, the present invention relates to a method of 
obtaining high expression of a transgene in plants specifically in chromoplasts- 
containing cells. 

In one further embodiment, the present invention relates to a method of 
importing a carotenoid-biosynthesis enzyme encoded by a transgene into 
chromoplasts. 

BRIEEDESCRTPTION OF THE.DRAWIN.GS. 

The invention herein described, by way of example only, with reference to 
the accompanying drawings, wherein: 

FIG. 1 is a general biochemical pathway of p-carotene biosynthesis, in 
which pathway all molecules are depicted in an d\\-trans configuration, wherein 
IPP is isopentenyl pyrophosphate, DMA?? is dimeihylallyl pyrophosphate, GPP is 
geranyl pyrophosphate, FPP is farnesyl pyrophosphate. GGPP is geranylgeranyl 
pyrophosphate and, PPPP is prephytoene pyrophosphate; 



BNStXXIO: <WO_9818910A1_L> 



wo 98/18910 




PCT/US97/17819 



21 

FIG. 2 is an identity map between the nucleotide sequence of the crtO 
cDNA of the present invention (CRTOA.SEQ) and the cDNA cloned by Kajiwara 
ei a/.. (CRTOJ.SEQ) [see, Kajiwara S, Kakizono T, Saito T, Kondo K. Ohtani T. 
Nishio N, Nagai S and Misawa N (1995) Isolation and functional identification of 
5 a novel cDNA for astaxanthin biosynthesis from Haematococcus pluvia/is. and 
astaxanthin synthesis in Escherichia coli. Plant Molec Biol 29: 343-352], using a 
GCG software, wherein (:) indicate identity, (-) indicate a gap and nucleotides 
numbering is according to SEQ ID NO:4 for CRTOA.AMl and Kajiwara et al., for 
CRTOJ.AMl; 

10 FIG. 3 is an identity map between the amino acid sequence encoded by the 

crtO cDNA of the present invention (CRTOA.AMl) and the amino acid sequence 
encoded by the cDNA cloned by Kajiwara e/ aL, (CRTOJ.AMl) [see, Kajiwara S, 
Kakizono T, Saito T, Kondo K, Ohtani T, Nishio N, Nagai S and Misawa N (1995) 
Isolation and functional identification of a novel cDNA for astaxanthin 

15 biosynthesis from Haematococcus pluvialis, and astaxanthin synthesis in 
Escherichia coli. Plant Molec Biol 29: 343-352], using a GCG software, wherein 
(:) indicate identity, (-) indicate a gap and amino acids numbering is according lo 
SEQ ID NO:4 for CRTOA.AMl and Kajiwara et al., for CRTOJ.AMl; 

FIG. 4 is a schematic depiction of a pACYC184 derived plasmid designated 

20 pBCAR and includes the genes crtE, crtB, crtJ and crtY of Erwinia herbicola, 
which genes are required for production of P-carotene in Escherichia coli cells; 

FIG. 5 is a schematic depiction of a pACYC184 derived plasmid designated 
pZEAX and includes the genes crtE, crtB, crtJ, crtY and crtZ from Erwinia 
herbicola, which genes are required for production of zeaxanthin in Escherichia 

25 coli cells; 

FIG. 6 is a schematic depiction of a pBluescriptSK" derived plasmid 
designated pHPK, containing a full length cDNA insert encoding a P-carotene C- 
4-oxygenase enzyme from Haematococcus pluvialis, designated crtO and set forth 
in SEQ ID NO:l, which cDNA was identified by color complementation of 

30 Escherichia coli cells; 

FIG. 7 is a schematic depiction of a pACYC184 derived plasmid designated 
pCANTHA which was derived by inserting a 1.2 kb Pst\-Pstl DNA fragment, 
containing the cDNA encoding the p-C-4-oxygenase from Haematococcus 
pluvialis isolated from the plasmid pHPK of Figure 6 and inserted into a Pst\ site 

35 in the coding sequence of the crtZ gene in the plasmid pZEAX of Figure 5; this 
recombinant plasmid carries the genes crtE, crtB, crtJ, crtY of Erwinia herbicola 
and the crtO gene of Haematococcus pluvialis, all required for production of 
canthaxanthin in Escherichia coli cells; 
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FIG. 8 is a schematic depiction of a pACYCl84 derived plasmid designated 
pASTA which was derived by inserting the 1.2 kb Pst\-Pst\ DNA fragment, 
containing the cDNA of the P-C-4-oxygenase from Haematococcus pluvialis 
isolated from the plasmid pHPK of Figure 6 and inserted into a Pst\ site which 
5 exists 600 bp downstream of the crtE gene in the plasmid pZEAX of Figure 5; this 
recombinant plasmid carries the genes crtE, crtB, crt], crtY. criZ of Erwinici 
herbicola and the crtO gene of Haematococcus pluvialis, all required for 
production of astaxanthin in Escherichia coli cells; 

FIG. 9 is a schematic depiction of a pBR328 derived plasmid designated 

10 PAN3.5-ICETO which was derived by inserting the 1.2 kb Pstl-Pst] DNA 
fragment, containing the cDNA of the P-C-4-oxygenase from Haematococcus 
pluvialis isolated from the plasmid pHPK of Figure 6 and inserted into a Pst\ site 
which exists in a P-lactamase gene in a plasmid designated pPAN35D5 [described 
in Hirschberg J, Ohad N, Pecker I and Rahat A (1987) Isolation and 

15 characterization of herbicide resistant mutants in the cyanobacterium 
Synechococcus R2. Z. Naturforsch 42c: 102-112], which carries the psbAl gene 
from the cyanobacterium Synechococcus PCC7942 in the plasmid vector pBR328 
[see, Hirschberg J, Ohad N, Pecker 1 and Rahat A (1987) Isolation and 
characterization of herbicide resistant mutants in the cyanobacterium 

20 Synechococcus R2. Z. Naturforsch 42c: 102-1 12]; this recombinant plasmid carries 
the crtO gene of Haematococcus pluvialis, required for production of astaxanthin 
in Synechococcus PCC7942 cells; 

FIG. 10 is a schematic depiction of the T-DNA region of a Ti binary 
plasmid {E. coli, Agrobacterium) designated pBIB [described by Becker D 

25 (1990) Binary vectors which allow the exchange of plant selectable markers 
and reporter genes. Nucleic Acids Research 18:230] which is a derivative of the 
Ti plasmid pBIlOl [described by Jeffesrson AR, Kavanagh TA and Bevan WM 
(1987) GUS fusions: p-glucuronidase as a sensitive and versatile gene fusion 
marker in higher plants. The EMBO J. 6: 3901-3907], wherein Br and Bl are 

30 the right and left borders, respectively, of the T-DNA region, pAg7 is the 
polyadenylation site of gene 7 of Agrobacterium Ti-plasmid, pAnos is a 250 bp 
long DNA fragment containing the poly adenylation site of the nopaline 
synthase gene of Agrobacterium, NPT 11 is a 1,800 bp long DNA fragment 
coding for kanamycin resistance, pnos is a 300 bp long DNA fragment 

35 containing the promoter sequence of the nopaline synthase gene of 
Agrobacterium, whereas pAnos is a 300 bp long DNA fragment containing the 
poly adenylation site of the nopaline synthase gene of Agrobacterium; 
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FIG. 1 1 is a schematic depiction of the T-DNA region of a Ti binary 
plasmid {E. coll, Agrohactermm) designated pPTBlB which was prepared by 
cloning a genomic DNA sequence of a tomato species Lycopersicon 
esciA/cmum marked PT (nucleotides 1 to 1448 of the Pds gene as published in 

5 Mann V, Pecker I and Hirschberg J (1994) cloning and characterization of the 
gene for phytoene desaturase (Pds) from tomato {Lycopersicon esculentiim). 
Plant Molecular Biology 24: 429-434), which contains the promoter of the Pds 
gene and the coding sequence for the amino terminus region of the polypeptide 
PDS that serve as a transit peptide for import into chloroplasts and 

10 chromoplasts, into a HindlW-Smal site of the binary plasmid vector pBIB of 
Figure 10. wherein Br and Bl, pAg7, pAnos, NPT U, pnos and pAnos are as 
defined above; 

FIG. 12 is a schematic depiction of the T-DNA region of a Ti binary 
plasmid {E. coli, Agrobacterium) designated pPTCRTOBIB which was 

15 prepared by cloning a 1,110 nucleotide long Eco^l\W-Nco\ fragment of the 
cDNA oi crtO from H, phivialis (nucleotides 211 to 1321 of SEQ ID NO:l) 
into the Sma\ site of the plasmid pPTBlB of Figure 11, such that the coding 
nucleotide sequence of the amino terminus of PDS is in the same reading frame 
of crtO, wherein Br and Bl, pAg7, pAnos, NPT II, pnos, and pAnos are as 

20 defined above, PT is the promoter and transit peptide coding sequences of Pds 
from tomato and CRTO is the nucleotide sequence of crtO from H. pluvialis 
(nucleotides 2 1 1 to 1 32 1 of SEQ ID NO: 1 ); 

FIG. 13 shows a Southern DNA blot analysis of ////ic/III-digested genomic 
DNA extracted from wild type (WT) and crtO tobacco transgenic plants, 

25 designated 2, 3, 4, 6, 9 and 10, according to the present invention, using the crtO 
cDNA as a radioactive probe essentially as described in Sambrook et aL, 
Molecular Cloning; A Laboratory Manual. Cold Spring Harbor Laboratoiy, Cold 
Spring Harbor, N.Y. 1989, wherein the size of marker (M) DNA fragments in 
kilobase pairs (kb) is indicated on the left as well as the expected position (arrow) 

30 of an internal T-DNA HindlU fragment as was deduced from the sequence of 
pPTPDSBIB shown in Figure 12 which contain the crtO cDNA sequence; 
FIG. 14 shows a biosynthesis pathway of astaxanthin; 

FIG. 15 shows a flower from a wild type tobacco plant and a tlower from a 
transgenic tobacco plant according to the present invention. 

35 

DESCRJPTlQKQFjmEJEE^^ 

The present invention is, in general, of a biotechnological method for 
production of (3S,yS) astaxanthin. In particular, the present invention is of a 
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peptide having a P-C-4-oxygenase activity; a DNA segment coding for this 
peptide; an RMA segments coding for this peptide; a recombinant DNA molecule 
comprising a vector and the DNA segment; a host cell or organism containing the 
above described recombinant DNA molecule or DNA segment; and of a method 
3 for biotechnologically producing (SS^S'S) astaxanthin or a food additive containing 
{3S3'S) astaxanthin, using the host. 

The unicellular fresh-water green alga Haematococcus pluvialis 
accumulates large amounts of (3S,3'5) astaxanthin when exposed to unfavorable 
growth conditions, or following different environmental stresses such as phosphate 
10 or nitrogen starvation, high concentration of salt in the growth medium or high 
light intensity [see, Yong YYR and Lee YK (1991) Phycologia 30 257-261 ; Droop 
MR (1954) Arch Microbiol 20: 391-397; and, Andrewes A.G, Borch G. Liaaen- 
Jensen S and Snatzke 6.(1974) Acta Chem Scand B28: 730-736]. During this 
process, the vegetative cells of the alga form cysts and change their color from 
1.3 green to red. The present invention discloses the cloning of a cDNA from 
Haematococcus pluvialis, designated crtO, which encodes a (3-C-4-oxygenase, the 
enzyme that converts p-carotene to canthaxanthin, and its expression in a 
heterologous systems expressing p-carotene hydroxylase (e.g., Erwinia herbicola 
crtZ gene product), leading to the production of (35,3'5') astaxanthin. 
20 The crtO cDNA and its encoded peptide having a p-C-4-oxygenase activity 

are novel nucleic and amino acid sequences, respectively. The cloning method of 
the crtO cDNA took advantage of a strain of Escherichia coli, which was 
genetically engineered to produce p-carotene, to which a cDNA library of 
Haematococcus pluvialis was transfected and expressed. Visual screening for 
25 brown-red pigmented Escherichia coli cells has identified a canthaxanthin 
producing transformant. Thus cloned cDNA has been expressed in two 
heterologous systems {Escherichia coli and Synechococcus PCC7942 cells) both 
able to produce P-carotene and further include an engineered {Erwinia herbicola 
crtZ gene product) or endogenous p-carotene hydroxylase activity, and was shown 
30 to enable the production of (3S,3'5) astaxanthin in both these systems. 

The crtO cDNA or its protein product exhibit no meaningful nucleic- or 
amino acid sequence similarities to the nucleic- or amino acid sequence of crtW 
and its protein product isolated fi-om the marine bacteria Agrobacterium 
auranticacum and Alcaligenes PC-1 that produce ketocarotenoids such as 
35 astaxanthin [see, Misawa N, Kajiwara S, Kondo K, Yokoyama A, Satomi Y, Saito 
T, Miki W and Ohtani T (1995) Canthaxanthin biosynthesis by the conversion of 
methylene to keto groups in a hydrocarbon P-carotene by a single gene. 
Biochemical and biophysical research communications Vol. 209, pp. 867-876]. 
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Mowever, the crtO cDNA and its protein product exJiibil substantial 
nucleic- and amino acid sequence identities with the nucleic- and amino acid 
sequence of a recently cloned cDNA encoding a 320 amino acids protein product 
having p^carotene oxygenase activity, isolated from Haematococcus pluvialis [see, 
5 Kajiwara S, Kakizono T, Saito T, Kondo K, Ohtani Nishio N. Nagai S and 
Misawa N (1995) Isolation and functionaf identification of a novel cDNA for 
astaxanthin biosynthesis from Haematococcus pluvialis.. and astaxanthin synthesis 
in Escherichia coli. Plant Molec Biol 29: 343-352]. Nevertheless, as presented in 
Figure 2 the degree of sequence identity between the crtO cDNA (CRTOA.SEQ in 

10 Figure 2) and the cDNA described by Kajiwara et al. (CRTOJ.SEQ in Figure 2) 
[see reference above] is 75.7% and, as presented in Figure 3 the degree of 
sequence identity between the crtO cDNA protein product (CRTOA.AMl in 
Figure 3) and the protein described by Kajiwara et al. (CRTOJ.AMI in Figure 3) is 
78%, as was determined using a GCG software. 

15 As will be described in details hereinbelow, the crtO cDNA can thus be 

employed to biotechnologically produce {3S,yS) astaxanthin in systems which are 
either easy to grow and can be used directly as an additive to fish food, or systems 
permitting a simple and low cost extraction procedure of astaxanthin. 

In one embodiment, the present invention relates to a DNA segment coding 

20 for a polypeptide comprising an amino acid sequence corresponding to 
Haematococcus pluvialis crtO gene and allelic and species variations and 
functional naturally occurring and/or man-induced variants thereof. The phrase 
•allelic and species variations and functional naturally occurring and/or man- 
induced variants' as used herein and in the claims below refer to the source of the 

25 DNA (or RNA as described below) or means known in the art for obtaining it. 
However the terms Variation' and Variants' indicate the presence of sequence 
dissimilarities (i.e., variations). It is the intention herein and in the claims below 
that the sequence variations will be 77-80%, preferably 80-85%, more preferably 
85-90%, most preferably 90-100% of identical nucleotides. In a preferred 

30 embodiment the DNA segment comprises the sequence set forth in SEQ ID NO:l. 
In another preferred embodiment, the DNA segment encodes the amino acid 
sequence set forth in SEQ ID NO:4. 

The invention also includes a pure DNA segment characterized as including 
a sequence which hybridizes under high stringency conditions [e.g., as described in 

35 Sambrook et al.. Molecular Cloning; A Laboratory Manual. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y. 1989] to a nucleic acid probe which 
includes at least fifteen, preferably at least fifty, more preferably at least hundred, 
even more preferably at least two hundred, even more preferably at least five 
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hundred successive nucleotides of SEQ ID NO:] or SEQ ID NO:2. Alternatively, 
the DNA segment of the invention may be characterized as being capable of 
hybridizing under low-stringent conditions to a nucleic acid probe which includes 
the coding sequence (nucleotides 166 through 11 52) of SEQ ID NO:l or SEQ ID 
5 NO:2. An example of such low-stringency conditions is as described in Sambrook 
ei a/., using a lower hybridization temperature, such as, for example, 20OC below 
the temperature employed for high-stringency hybridization conditions, as 
described above. 

The DNA segment of the invention may also be characterized as being 
10 capable of hybridizing under high-stringent conditions to a nucleic acid probe 
which includes the coding sequence (nucleotides ] 66 through 1 1 52) of SEQ ID 
NO:] or SEQlDNO:2. 

The invention also includes a synthetically produced oligonucleotide (e.g., 
oligodeoxyribonucleotide or oligoribonucleotide and analogs thereof) capable of 
I? hybridizing with at least ten-nucleotide segments of SEQ ID NO:l or SEQ ID 
NO:2. 

In another embodiment, the present invention relates to an RNA segment 
coding for a polypeptide comprising an amino acid sequence corresponding to 
Haematococcus pluvialis crtO gene and allelic and species variations and 

20 functional naturally occurring and/or man-induced variants thereof. In a preferred 
embodiment the RNA segment comprises the sequence set forth in SEQ ID NO:2. 
In another preferred embodiment, the RNA segment encodes the amino acid 
sequence set forth in SEQ ID NO:4. 

The invention also includes a pure RNA characterized as including a 

23 sequence which hybridizes under high stringent conditions to a nucleic acid probe 
which includes at least at least fifteen, preferably at least fifty, more preferably at 
least hundred, even more preferably at least two hundred, even more preferably at 
least five hundred succsesive nucleotides of SEQ ID NO: I or SEQ ID NO:2. 
Alternatively, the RNA of the invention may be characterized as being capable of 

30 hybridizing under low-stringent conditions to a nucleic acid probe which includes 
the coding sequence (nucleotides 166 through 1 152) of SEQ ID NO:l or SEQ ID 
NO:2. Additionally, the RNA of the invention may be characterized as being 
capable of hybridizing under high-stringent conditions to a nucleic acid probe 
which includes the coding sequence (nucleotides 166 through 1152) of SEQ ID 

35 NO: 1 or SEQ ID NO:2. 

In another embodiment, the present invention relates to a polypeptide 
comprising an amino acid sequence corresponding to a Haematococcus pluvialis 
crtO gene and allelic, species variations and flinctional naturally occurring and/or 
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man-induced variants thereof. In a preferred embodiment. Die polypeptide 
comprises the amino acid sequence set forth in SEQ ID NO:4. 

It should be noted that the invention includes any peptide which is 
homologous (i.e., 80-85%, preferably 85-90%, more preferably 90-100% of 
5 identical amino acids) to the above described polypeptide. The term 'homologous' 
as used herein and in the claims below, refers (o the sequence identity between two 
peptides. When a position in both of the two compared sequences is occupied by 
identical amino acid monomeric subunits, it is homologous at that position. The 
homology between two sequences is a function of the number of homologous 
10 positions shared by the two sequences. For example, if eight often of the positions 
in two sequences are occupied by identical amino acids then the two sequences are 
80% homologous. 

Other polypeptides which are also included in the present invention are 
allelic variations, other species homologs, natural mutants, induced mutants and 
15 peptides encoded by DNA that hybridizes under high or low stringency conditions 
(see above) to the coding region (nucleotides 1 66 through 1 1 52) of SEQ ID NO: 1 
or SEQ ID NO:2. 

In another embodiment, the present invention relates to a recombinant DNA 
molecule comprising a vector (for example plasmid or viral vector) and a DNA 

20 segment coding for a polypeptide, as described above. In a preferred embodiment, 
the DNA segment is present in the vector operably linked to a promoter. 

In a further embodiment, the present invention relates to a host cell 
containing the above described recombinant DNA molecule or DNA segment. 
Suitable host cells include prokaryotes (such as bacteria, including Escherichia 

25 coli) and both lower eukaryotes (for example yeast) and higher eukaryotes (for 
example, algae, plant or animal cells). Introduction of the recombinant molecule 
into the cell can be effected using methods known in the art such as, but not 
limited to, transfection, transformation, micro-injection, gene bombardment etc. 
The cell thus made to contain the above described recombinant DNA molecules 

30 may be grown to form colonies or may be made to differentiate to form a 
differentiated organism. The recombinant DNA molecule may be transiently 
contained (e.g., by a process known in the art as transient transfection) in the cell, 
nevertheless, it is preferred that the recombinant DNA molecule is stably contained 
(e.g., by a process known in the art as stable transfection) in the cell. Yet in a 

35 preferred embodiment the cell is endogenously producing, or is made by genetic 
engineering means to produce, p-carotene, and the cell contains endogenous or 
genetically engineered p-carotene hydroxylase activity. Such a cell may be used 
as a food additive for animal (e.g., salmon) and human consumption. Furthermore. 



BNSDOCID:<WO 981891DA1 I > 



wo 98/18910 




PCT/US97/178I9 



28 

such a cell may be used for extracting aslaxanthin and/or other xanthophylls, as 
described hereinbelow. 

hi a further embodiment, the present invention relates to a host transgenic 
organism (e.g., a higher plant or animal) containing the above described 
5 recombinant DNA molecule or the above described DNA segment in its cells. 
Introduction of the recombinant molecule or the DNA segment into the host 
transgenic organism can be effected using methods known in the art. Yet. in a 
preferred embodiment the host organism is endogenously producing, or is made by 
genetic engineering means to produce, p-carotene and, also preferably the host 
10 organism contains endogenous or genetically engineered p-carotene hydroxylase 
activity. Such an organism may be used as a food additive for animal (e.g., 
salmon) and human consumption. Furthermore, such an organism may be used for 
extracting astaxanthin and/or other xanthophylls, as described hereinbelow. 

hi another embodiment, the present invention relates to a method of 
15 producing astaxanthin using the above described host cell or transgenic organism. 
In yet another embodiment, the present invention relates to a method of producing 
xanthophylls such as canthaxanthin, echinenone, cryptoxanthin, isocryptoxanthin, 
hydroxyechinenone, zeaxanthin, adonirubin, 3-hydroxyechinenone, 3'- 
hydroxyechinenone and/or adonixanthin using the above described host cell or 
20 transgenic organism. For these purposes provided is a cell or a transgenic 
organism as described above. The host cell or organism are made to grow under 
conditions favorable of producing astaxanthin and the above listed additional 
xanthophylls which are than extracted by methods known in the art. 

]n yet another embodiment, the present invention relates to a transgenic 
25 plant expressing a transgene coding for a polypeptide including an amino acid 
sequence corresponding to Haematococcus pluvialis crtO gene, allelic and species 
variants or functional naturally occurring or man-induced variants thereof. 
Preferably the expression is highest in chromoplasts-containing tissues. 

In yet another embodiment, the present invention relates to a recombinant 
30 DNA vector which includes a first DNA segment encoding a polypeptide for 
directing a protein into plant chloroplasts or chromoplasts (e.g., derived from the 
Pds gene of tomato) and an in frame second DNA segment encoding a polypeptide 
including an amino acid sequence corresponding to Haematococcus pluvialis crtO 
gene, allelic and species variants or functional naturally occurring and man- 
35 induced variants thereof. 

In yet another embodiment, the present invention relates to a recombinant 
DNA vector which includes a first DNA segment including a promoter highly 
expressible in plant chloroplasts or chromoplasts-containing tissues (e.g., derived 
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Irom the Pds gene of tomato) and a second DNA segment encoding a polypeptide 
including an amino acid sequence corresponding to Haematococcus pliivialis crtO 
gene, allelic and species variants or functional naturally occurring and man- 
induced variants thereof 

5 

Reference in now made to the following examples, which together with the 
above descriptions, illustrate the invention. 

EXAMPLES 

10 

The following protocols and experimental details are referenced in the 
Examples that follow: 

Algae and growth conditions. Haematococcus pluvialis (strain 34/7 from 

15 the Culture Collection of Algae and Protozoa, Windermere, UK) was kindly 
provided by Dr. Andrew Young from the Liverpool John Moores University. 
Suspension cultures of the alga were grown in a liquid medium as described by 
Nichols and Bold [see, Nichols HW, Bold HC (1964) Trichsarcina polymorpha 
gen et sp nov J Phycol 1: 34-39]. For induction of astaxanthin biosynthesis cells 

20 were harvested, washed in water and resuspended in a nitrogen-depleted medium. 
The cultures were maintained in 250 ml Erlenmeyer flasks under continuous light 
(photon flux of 75 ^E/m2/s), at IS^'C, on a rotary shaker at 80 rpni. 

Construction of cDNA library. The construction of a cDNA library from 
Haematococcus pluvialis was described in detail by Lotan and Hirschberg (1995) 

25 FEBS letters 364: 125-128. Briefly, total RNA was extracted from algal cells 
grown for 5 days under nitrogen-depleted conditions (cell color brown-red). Cells 
from a 50 ml culture were harvested and their RNA content was extracted using 
Tri reagent (Molecular Research Center, INC.)- Poly-An RNA was isolated by 
two cycles of fractionation on oligo dT-cellulose (Boehringer). The final yield was 

30 1 .5% of the total RNA. The cDNA library was constructed in a Uni-ZAPTM XR 
vector, using a ZAP-cDNA synthesis kit (both from Stratagene). Escherichia coli 
cells of strain XL 1 -Blue MRF* (Stratagene) were used for amplification of the 
cDNA library. 

Plasmids and Escherichia coli strains. Plasmid pPL376, which contains 
35 the genes necessary for carotenoid biosynthesis in the bacterium Erwinia herbicolo 
was obtained from Tuveson [for further details regarding plasmid pPL376 see, 
Tuveson RW, Larson RA & Kagan J (1988) Role of cloned carotenoid genes 
expressed in Escherichia coli in protecting against inactivation by near-UV light 
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and specific phototoxic molecules. J Bacteriol 170: 4675-4680]. Cells of 
Escherichia coli strain JM109 that carry the plasmid pPL376 accumulate the bright 
yellow carotenoid, zeaxanthin glycoside. In a first step, a 1.1 kb Sall-Sall 
fragment was deleted from this plasmid to inactivate the gene crtX, coding for 
zeaxanthin glucosyl transferase. In a second step, partial BomH} cleavage of the 
plasmid DNA, followed by self ligation, deleted a 0.8 kb fragment which 
inactivated crtZ, encoding {5-carotene hydroxylase. A partial BglU cleavage 
generated a fragment of 7.4 kb which was cloned in the BamHJ site of the plasmid 
vector PACYC184. As shown in Figure 4, the resulting recombinant plasmid. 
which carried the genes crtE, crtB, crtl and crtY, was designated pBCAR [Lotan 
and Hirschberg (1995) FEES letters 364: 125-128]. 

Plasmid pBCAR was transfected into SOLR strain cells of Escherichia coli 
(Stratagene). Colonies that appeared on chloramphenicol-containing Luria Broth 
(LB) medium [described in Sambrook et oL, Molecular Cloning; A Laboratory 
Manual. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. 1989], carried 
this plasmid and developed a deep yellow-orange color due to the accumulation of 
P-carotene. 

As shown in Figure 5, an additional plasmid, designated pZEAX, which 
allows for zeaxanthin synthesis and accumulation in Escherichia coli was 
constructed [this plasmid is described in details in Lotan and Hirschberg (1995) 
FEES letters 364: 125-128]. SOLR strain Escherichia coli cells were used as a 
host for the pZEAX plasmid. Escherichia coli cells were grown on LB medium 
(see above), at 37°C in the dark on a rotary shaker at 225 rpm. Ampicillin <50 yi 
g/ml) and/or chloramphenicol (30 pg/ml) (both from Sigma) were added to the 
25 medium for selection of appropriate transformed cells. 

As shown in Figure 6, a plasmid, pHPK, containing the full length cDNA of 
the p-carotene C-4-oxygenase enzyme was identified by color complementation as 
described by Lotan and Hirschberg (1995) FEES letters 364: 125-128 (see 
description herein below). A 1.2 kb Pstl-Pstl DNA fragment, containing the 
cDNA of the p-C-4-oxygenase from Haematococcus pluvialis, was isolated from 
plasmid pHPK and inserted into a Pstl site in the coding sequence of the criZ gene 
in the plasmid pZEAX. This recombinant plasmid was designated pCANTHA and 
is shown in Figure 7. 

The same 1.2 kb Pstl-Pstl fragment was also inserted into a Pstl site which 
exists 600 bp downstream of the crtE gene in the plasmid pZEAX. The resulting 
recombinant plasmid was designated pASTA and is shown in Figure 8. 

The same 1 .2 kb Pstl-Pstl fragment was also inserted into a Pstl site which 
exists in the p-Iactamase gene in the plasmid pPAN35D5 [Hirschberg J. Ohad N. 
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Pecker I and Rahat A (1987) Isolation and characterization of herbicide resistant 
mutants in the cyanobacterium Synechococcus R2. Z, Naturforsch 42c: 102-1 12], 
which can-ies ihtpsbAl gene from the cyanobacterium Synechococcus PCC7942 in 
the plasmid vector pBR328 [Hirschberg J, Ohad N, Pecker 1 and Rahal A (1987) 
5 Isolation and characterization of herbicide resistant mutants in the cyanobacterium 
Synechococcus R2. Z. Naturforsch 42c: 102-112]. This plasmid was designated 
PAN3.5-l<CETO and is shown in Figure 9. This plasmid was used in the 
transformation of Synechococcus PCC7942 cells following procedures described 
by Golden [Golden SS (1988) Mutagenesis of cyanobacteria by classical and 

10 gene-transfer-based methods. Methods Enzyniol 167: 714-727]. 

Excision of phage library and screening for a (3-carotene oxygenase 
gene. Mass excision of the cDNA library, which was prepared as described 
hereinabove, was carried out using the ExAssist helper phage (Stratagene) in cells 
of SOLR strain of Escherichia coli that earned the plasmid pBCAR, The excised 

15 library in phagemids form was transfected into Escherichia coli cells strain XLl- 
Blue and the cells were plated on LB plates containing 1 mM isopropylthio-P-D- 
galactosidase (IPTG), 50 \xglm\ ampicillin and 30 Mg/ml chloramphenicol, in a 
density that yielded approximately 100-150 colonies per plate. The plates were 
incubated at 37''C overnight and further incubated for two more days at room 

20 temperature. The plates were then kept at 4°C until screened for changes in colony 
colors. 

A plasmid for high expression of crtO in chromoplasts. As shown in 
Figures 10-11, a genomic DNA sequence of a tomato species Lycopersicon 
esculentum (nucleotides 1 to 1448 of the Pds gene [as published in Mann V, 

25 Pecker 1 and Hirschberg J (1994) cloning and characterization of the gene for 
phytoene desaturase (Pds) from tomato {Lycopersicon esculentum). Plant 
Molecular Biology 24: 429-434], which contains the promoter of the Pds gene 
and the coding sequence for the amino terminus region of the polypeptide PDS 
that serve as a transit peptide for import into chloroplasts and chromoplasts, 

30 was cloned into a HindlU-Smal site of the binary plasmid vector pBIB, 
[described by Becker D (1990) Binary vectors which allow the exchange of 
plant selectable markers and reporter genes. Nucleic Acids Research 18:230], 
shown in Figure 10. The recombinant plasmid was designated pPTBIB and is 
shown in Figure 1 1 . 

35 As shown in Figure 12, a 1,110 nucleotide long EcoAllU-Ncol fragment, 

containing the cDNA of crtO from H. pluvialis (nucleotides 211 to 1321 of 
SEQ ID NO:]) was sub-cloned into the Smal site of the plasmid pPTBIB 
(Figure 11) so that the coding nucleotide sequence of the amino terminus of 
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Pds is in the same reading frame as criO. The recombinant plasmid was 
designate pPTCRTOBIB. 

Formation of transgenic higher plant. The DNA of pPTCRTOBIB 
was extracted from E. coli cells and was transferred into cells of 
Ai^robacterium tumefaciens strain EHA105 [described by Hood EE, Gelvin SB. 
Melchers LS and Hoekema A (1993) Transgenic Research 2:208-218] using 
electroporation as described for E. coli [Dower JW. Miller FJ and Ragdsale 
WC (1988) High efficiency transformation of E. coli by high voltage 
electroporation. Nuc. Acids Res. 18: 6127-6145]. Agrobacterium cells were 
grown at 28 °C in LB medium supplemented with 50 \xg/m\ streptomycin and 
50 ng/ml kanamycin as selective agents. Cells of Agrobacterium carrying 
pPTCRTOBIB were harvested from a suspension culture at the stationary phase 
of growth and used for transformation as described by Horsch RB, Fry JE, 
Hoffmann NL, Eicholtz D, Rogers SG and Fraley RT, A simple and general 
method for transferring genes into plants. Science (1985) 227:1229-1231; and 
Jeffesrson AR, Kavanagh TA and Bevan WM (1987) GUS fusions: jJ- 
glucuronidase as a sensitive and versatile gene fusion marker in higher plants. 
The EMBO J. 6: 3901-3907. 

Leaf explants of Nicotiana tobaccum strain NN were infected with the 
transformed Agrobacterium cells and kanamycin-resistant transgenic plants 
were regenerated according to protocols described by Horsch et al. (1985) and 
Jefferson et al. (1987) cited above. 

With reference now to Figure 13, the presence of the DNA sequence of the 
crtO gene-construct in the fully developed regenerated plants was determined by 
25 DNA Southern blot analysis. To this end DNA was extracted from the leaves 
[according to a protocol described by Kanazawa and Tsutsumi (1992) Extraction 
of restrictable DNA from plants of the genus Nelumbo. Plant Molecular Biology 
Reports 10: 316-318], digested with the endonucJease HindlU, the fragments were 
size separated by gel electrophoresis and hybridized with radioactively labeled 
30 crtO sequence (SEQ ID NO: 1 ). 

It was determined that each transgenic plant that was examined contained at 
least one copy of the crtO DNA sequence, yielding a 1.75 kb band (arrow), 
originating from an internal Hindlll-HincilU fragment of the T-DNA of 
pPTCRTOBIB, additional bands originating from partial digestion, additional 
3.5 band/s whose sizes vary, depending on the position of insertion in the plant 
genome and a 1 .0 kb band originating from the tobacco plant itself which therefore 
also appears in the negative control WT lane. 
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Sequence analysis. DNA sequence analysis was carried out by the dideoxy 
method [see. Sanger F, Nicklen S & Coulsen AR (1977) DNA sequencing with 
chain termination inhibitors. Proc Natl Acad Sci USA 74: 5463-5467]. 

Carotenoids analysis. Aliquots of Escherichia coli cells which were 
5 grown in liquid in LB medium were centrifuged at 13,000 g for 10 minutes, 
washed once in water and re-centrifuged. After removing the water the cells were 
resuspended in 70 ^xl of acetone and incubated at 65''C for 15 minutes. The 
samples were centrifliged again at 13,000 g for 10 minutes and the carotenoid- 
containing supernatant was placed in a clean tube. The carotenoid extract was 

10 blown to dryness under a stream of nitrogen (N2) gas and stored at -20°C until 
required for analysis. Carotenoids from plant tissues were extracted by mixing 
0.5-1.0 gr of tissue with 100 ^il of acetone, followed by incubation at 65°C for 15 
minutes and then treating the samples as described above. 

High-performance liquid chromatography (HPLC) of the carotenoid 

15 extracts was carried out using an acidified reverse-phase CI8 column, Spherisorb 
ODS-2 (silica 5 \xm 4.6 mm x 250 mm) (Phenomenex®). The mobile phase was 
pumped by triphasic Merck-Hitachi L-6200A high pressure pumps at a flow rate of 
1.5 ml/min. The mobile phase consisted of an isocratic solvent system comprised 
of hexane/dichloromethane/isopropyl alcohol/triethylamine (88.5:1 0: 1 .5:0.1 , v/v). 

20 Peaks were detected at 470 nm using a Waters 996 photodiode-array detector. 
Individual carotenoids were identified by their retention times and their typical 
absorption spectra, as compared to standard samples of chemically pure |3- 
carotene, zeaxanthin, echinenone, canthaxanthin, adonirubin and astaxanthin 
(The latter four were kindly provided by Dr. Andrew Young from Liverpool John 

25 Moores University). 

Thin layer chromatography (TLC) was carried out using silica gel 60 F254 
plates (Merck), using ethyl acetate/benzene (7:3, v/v) as an eluent. Visible 
absorption spectra were recorded with a Shimadzu UV-160A spectrophotometer. 
All spectra were recorded in acetone. Spectral fine structure was expressed in 

30 terms of %III/1I [Britton, G. (1995). UVA^isible Spectroscopy. In: Carotenoids; 
Vol IB, Spectroscopy. Eds. Britton G, Liaaen-Jensen S and Pfander H. 
Birkhauser Verlag, Basel, pp. 13-62]. 

Isolation and identification of the carotenoids extracted from cells of coli 
are treated in order of increasing adsorption (decreasing R/^values) on silica TLC 

35 plates. Carotenoids structure and the biosynthesis pathway of astaxanthin are 
given in Figure 14. The following details refer to the carotenoids numbered 1 
through 9 in Figure 14. 
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P-Carotene (]). R;rO. 92 inseparable from authentic (1). R/ -VIS Xmax nm: 
(428), 452, 457, = 0. 

Echinenone (2). R/-0.90 inseparable from authentic (2). R; .VIS X^ax rmr. 
455. %111/11 = 0. 

Canthaxanthin (3). R;-0.87. inseparable from authentic (3). Ri .VIS ;^max 
nm: 470. %I11/1] - 0. 



0. 



P-Cryptoxanthin (4). R/-0.83. .VIS Xj^^x "m: (428), 451, 479. %I1I/II 



Adonirubin (5). R/0.82 inseparable from authentic (5). R, .VIS i^max nm: 
10 476. %III/II = 0. 

Astaxanthin (6). R/-0.79 inseparable from authentic (6). R, .VIS Xmax 
nm: 477, %II1/II=0. 

Adonixanthin (7). R/-0.72. R^ .VlS^maxim: 464, %I11/II = 0. 

Zeaxanthin (8). R/-0.65 inseparable from authentic (8). R; .VIS ^max "m: 
15 (428). 451, 483, %in/II = 27. 

Hydroxyechinenone (9). R/0.80, Rt, 3.0. VIS ^max nm: 464, %1II/II = 0. 

Chirality configuration. Chirality configuration of astaxanthin was 
determined by HPLC of the derived diastereoisomeric camphanates of the 
astaxanthin [Renstrom B, Borch G, Skulberg M and Liaaen-Jensen S <I981) 
20 Optical purity of (35,35")-astaxanthin from Haematococcus pluvialis. Phytochem 
20: 2561-2565]. The analysis proved that the Escherichia coli cells synthesize 
pure (35,3'S) astaxanthin. 

EXAMPLE 1 

25 Cloning the ^-C-4-oxygenase gene 

A cDNA library was constructed in Lambda ZAP II vector from poly-An 
RNA of Haematococcus pluvialis cells that had been induced to synthesize 
astaxanthin by nitrogen deprivation as described hereinabove. The entire library 

30 was excised into p-carotene-accumulating cells of Escherichia coli, strain SOLR. 
which carried plasmid pBCAR (shown in Figure 4). Screening for a p-carotene 
oxygenase gene was based on color visualization of colonies of size of 3 mm in 
diameter. Astaxanthin and other oxygenated forms of p-carotene (i.e.. 
xanthophylls) have distinct darker colors and thus can be detected from the yellow 

35 P-carotene background. The screening included approximately 100.000 colonies 
which were grown on LB medium plates containing ampicillin and 
chloramphenicol that selected for both the Lambda ZAP II vector in its plasmid 
propagating form and the pBCAR plasmid. Several colonies showed different 
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color lones but only one exhibited a conspicuous brown-red pigment. This colony 
presumed to contain a xanthophyll biosynthesis gene was selected for further 
analysis described hereinbelow in thie following Examples. 

5 EXAMPLE 2 

Analysis of the ^'C'4'Oxygenase activity in Escherichia coii 

The red-brown colony presumed to contain a xanthophyll biosynthesis gene 
(see Example 1 above) was streaked and further analyzed. First, the recombinant 

10 ZAP 11 plasmid carrying the cDNA clone that was responsible for xanthophyll 
synthesis in Escherichia coli was isolated by preparing plasmid DNA from the red- 
brown colony, transfecting it to Escherichia coli cells of the strain XL 1 -Blue and 
selection on ampicillin-containing medium. This plasmid, designated pHPK 
(pHPK is a Lambda ZAP II vector containing an insert isolated from the red-brown 

15 colony), was used to transform p-carotene-producing Escherichia coli cells 
{Escherichia coli SOLR strain that carry the plasmid pBCAR shown in Figure 4) 
resulting in the formation of red-brown colonies. Carotenoids from this 
transformant, as well as from the host cells (as control) were extracted by acetone 
and analyzed by HPLC. 

20 HPLC analysis of carotenoids of the host bacteria which synthesized p- 

carotene {Escherichia coli SOLR strain that carry the plasmid pBCAR shown in 
Figure 4), as compared with a brown-red colony, revealed that only traces of p- 
carotene were observed in the transformant cells while a new major peak of 
canthaxanthin and another minor peak of echinenone appeared [described in detail 

25 by Loian and Hirschberg (1995) FEBS letters 364: 125-128]. These results 
indicate that the cDNA in plasmid pHPK, designated crtO encodes an enzyme with 
p-C-4-oxygenase activity, which converts P-carotene to canthaxanthin via 
echinenone (see Figure 14). It is, therefore concluded that a single enzyme 
catalyzes this two-step ketonization conversion by acting symmetrically on the 4 

30 and 4' carbons of the P- and P'-rings of p-carotehe, respectively. 

EXAMPLE 3 
Production of astaxanthin in Escherichia coli cells 

35 To determine whether p-carotene hydroxylase (e.g., a product of the crtZ 

gene of Erwinia herbicola) can convert thus produced canthaxanthin to astaxanthin 
and/or whether zeaxanthin converted from p-carotene by p-carotene hydroxylase 
can be converted by p-C-4-oxygenase to astaxanthin, the crtO cDNA of 
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Haemaiococcus pluvialis thus isolated, was expressed in Escherichia coli cells 
together with the ci-tl gene of Erwinia herbicola. For this purpose, Escherichia 
coli cells of strain SOLR were transfected with either plasmid pASTA alone 
containing, as shown in Figure 8, both crtZ and crtO or, alternatively with both 
plasmids, pHPK containing, as shown in Figure 6, crtO, and pZEAX containing, as 
shown in Figure 5, crtZ. Carotenoids in the resulting transformed, cells were 
extracted and analyzed by HPLC as described above. The results, given in Table 
1, show the composition of carotenoids extracted from the cells containing the 
plasmid pASTA. Similar carotenoid composition is found in Escherichia coli cells 
which carry both pHPK and pZEAX. 



TABLE 1 



Carotenoid % of total carotenoid composition 

P-Carotene 8 0 

Echineone 17 

P-Cryptoxanthin 4.2 

Canthaxanthin 4 2 

Zeaxanthin 57 g 

Adonirubin 1 q 

Adonixanthin I7.9 

Astaxanthin 5.2 



The results presented in Table 1, prove that carotenoids possessing either a 
P-end group or a 4-keto-p-end group act as substrates for the hydroxylation 
reactions catalyzed by crtZ gene product at carbons C-3 and C-3'. The 
hydroxylation of p-carotene and canthaxanthin results in the production of 
zeaxanthin and astaxanthin, respectively. These hydroxy lations result in the 
production of astaxanthin and the intermediate ketocarotenoids, 3- 
hydroxyechinenone, adonixanthin and adonirubin. These results fiirther 
demonstrate that astaxanthin can be produced in heterologous cells by expressing 
the gene crtO together with a gene that codes for a P-carotene hydroxylase. 

EXAMPLE 4 

Sequence analysis of the gene for ^-carotene C-4-oxygenase 

The full length, as was determined by the presence of a poly A tail, of the 
cDNA insert in plasmid pHPK (1771 base pairs) was subjected to nucleotide 
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sequence analysis. This sequence, set forth, in SEQ ID NO:l, and its translation to 
an amino acid sequence set forth in SEQ ID NO:3 (329 amino acids), were 
deposited in EMBL database on May 1, 1995, and obtained the EMBL accession 
numbers X86782 and X86783, respectively. 
5 An open reading frame (ORF) of 825 nucleotides (nucleotides 166 through 

11 52 in SEQ ID NO:3) was identified in this sequence. This ORE codes for the 
enzyme p-carotene C-4-oxygenase having 329 amino acids set forth in SEQ ID 
NO:4. as proven by its llinctional expression in Escherichia coli cells (see 
Example 3 above). The gene for this enzyme was designated crtO. 

10 

EXAMPLE 5 
Transformation oj cyanobacteria with crtO 

The plasmid DNA of pPAN3.5-I<JETO, shown in Figure 9, was Iransfected 

15 into cells of the cyanobacterium Synechococcus PCC7942 according to the method 
described by Golden [Golden SS (1988) Mutagenesis of cyanobacteria by classical 
and gene-transfer-based methods. Methods Enzymol 167: 714-727]. The 
cyanobacterial cells were plated on BGll medium-containing petri dishes that 
contained also chloramphenicol. Colonies of chloramphenicol-resistanl 

20 Synechococcus PCC7942 which appeared after ten days were analyzed for their 
caroienoid content. As detailed in Table 2 below, HPLC analysis of these cells 
revealed that the major carotenoid components of the cells was. P-carotene, 
echinenone, canthaxanthin, adonirubin and astaxanthin, A similar analysis of the 
wild type strain and of Synechococcus PCC7942 transfected with a plasmid in 

25 which the orientation of the crtO gene is reversed (not shown), which is therefore 
not capable of producing an active protein, did not revealed production of 
echinenone, canthaxanthin, adonirubin and astaxanthin. 

These result prove that crtO of Haematococcus pluvialis can be expressed 
in cyanobacteria and that its expression provided a P-C-4-oxygenase enzymatic 

30 activity needed for the conversion of (J-carotene to canthaxanthin. This result 
further demonstrates that the endogenous P-carotene hydroxylase of 
Synechococcus PCC7942 is able to convert thus produced canthaxanthin to 
astaxanthin. Since the carotenoid biosynthesis pathway is similar in all green 
photosynthetic organism [see Figures 1 and 10 and, Pecker L Chamovitz D. Linden 

35 H, Sandmann G and Hirschberg J (1992) A single polypeptide catalyzing the 
conversion of phytoene to i^-carotene is transcriptionally regulated during tomato 
fruit ripening. Proc Natl Acad Sci USA 89: 4962-4966] it is deduced that 
astaxanthin can be produced in algae, and higher plants by expressing crtO in any 
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tissue thai express also the endogenous P-carotene hydroxylase. It is ftirther 
deduced that astaxanthin can be produced by any organism provided it contains 
either endogenous or engineered P-caroiene biosynthesis pathway, by expressing 
crtO in any tissue that express either endogenous or genetically engineered P- 
carotene hydroxylase. 

TABLE 2 



Carotenoid 

p-Carotene 

Echinenone 

Canthaxanthin 

Zeaxanthin 

Adonirubin 

Astaxanthin 



% of total carotenoid composition 

31.5 

18.5 

16.1 

22.3 

6.0 

5.6 



EXAMPLE 6 

Determining the chintlity configuration of astaxanthin 
produced in heterologous systems 

The chirality configurations of astaxanthin produced by Escherichia coli 
cells, as described under Example 3 hereinabove, and by cyanobacterium 
Synechococcus PCC7942 cells, as described in Example 5 hereinabove, .were 
determined by HPLC of the derived diastereoisomeric camphanates of the 
astaxanthin [Renstrom B, Borch G, Skulberg M and Liaaen-Jensen S (1981) 
Optical purity of (3,S,3S')-astaxanthin from Haematococcus pluvialis. Phytochem 
20: 2561-2565]. The analysis proved that the Escherichia coli and Synechococcus 
PCC7942 ceils described above, synthesize pure (35,3 'S) astaxanthin. 

EXAMPLE 7 
Transformation of a higher plant with crtO 

Producing natural astaxanthin in higher plants has two anticipated 
benefits. First, as a pure chemical, astaxanthin is widely used as feed additive 
for fish. It is a potential food colorant suitable for humans consumption and 
has potential applications in the cosmetic industry. Second, inducing 
astaxanthin biosynthesis in vivo in flowers and fruits will provide attractive 
pink/red colors which will increase their appearance and/or nutritious worth. 
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In flowers and fruits carotenoids are normally synthesized and 
accumulated to high concentration in chromoplasts, a typical pigment- 
containing plastids, thus providing typical intense colors to these organs. 
Inducing synthesis of astaxanthin in chromoplasts enables the accumulation of 
high concentration of this ketocarotenoid. Over-expression of carotenoid 
biosynthesis genes which results in elevated concentrations of carotenoids in 
chloroplasts, or other alterations in carotenoid composition in chloroplasts may 
damage the thylakoid membranes, impair photosynthesis and thus is deleterious 
to the plants. In contrast, increase of carotenoid concentration or alteration in 
carotenoid composition in chromoplasts do not affect the viabihty of the plant 
nor the yield of fruits and flowers. 

Thus, gene-transfer technology was used to implant the crtO gene 
isolated from the alga Haematococcus pluvialis, as described, into a higher 
plant, in such a way that its expression is up-regulated especially in 
chromoplast-containing cells. 

To this end, a T-DNA containing binary plasmid vector as shown in 
Figure 12 was assembled in E. coli from the promoter and coding DNA 
sequences of the transit peptide encoded by the Pds gene from a tomato species 
Lycopersicon esculentum, linked to the coding DNA sequence of cr/O from H, 
pluvialis. Upon stable transfer of this DNA construct via Agrobacterium- 
mediated transformation into a tobacco {Nicotiana tabacum NN) plant to form a 
transgenic plant, as described under methods above, the plant acquired the 
ability to produce ketocarotenoids especially in flower tissues (chromoplast- 
containing cells). It should be noted that the Pds gene promoter is capable of 
directing transcription and therefore expression especially in chloroplasts 
and/or chromoplasts-containing tissues of plants. It should be further noted that 
the transit peptide encoded by part of the Pds coding sequence is capable of 
directing conjugated (i.e., in frame) proteins into plaril chromoplasts and/or 
chloroplasts. 

As shown in Figure 15, in chromoplasts-containing cells, such as in the 
nectary tissue of the flower of tobacco, this DNA construct induces 
accumulation of astaxanthin and other ketocarotenoids to a higher level which 
alters the color from the normal yellow to red. 

Concentration and composition of carotenoids in chloroplasts-containing 
tissues, such as leaves, and in chromoplast-containing tissues, such as flowers, 
were determined in the transgenic plants and compared to nomial non- 
transformed plants. 
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Carotenoids compositions in leaves (chloroplasls-conlaining tissue) ami 
in the nectary tissue of flowers (chromoplast containing tissue) of wild type and 
transgenic tobacco plants were determined by thin layer chromatography (TLC) 
and by high pressure liquid chromatography (HPLC) as described above. 

Total carotenoids concentration in leaves (chloroplasts-coniaining 
tissue) and in the nectary tissue of flowers (chromoplast containing tissue) of 
wild type and transgenic tobacco plants are summarized in Tables 3 below. 

Percents of carotenoids composition in leaves of wild-type and 
transgenic tobacco plants are summarized in Tables 4 below. 
10 Percents of carotenoids composition in the nectary tissue of flowers of 

wild-type and transgenic tobacco plants are summarized in Tables 5 below. 



15 TABLE 3 

^ig carotenoids per gr fresh weight 

Wild-type Transgenic with crtO 

Leaf 

20 (Chloroplasts) 200 240 

Nectary tissue 

(Chromoplasts) 280 360 



TABLE 4 

30 % of total carotenoids composition in chloroplasts-containing tissue (leaf) 



Wild-type Transgenic 

p-carotene 29.9 26.7 

neoxanthin 5.0 5.9 

35 violaxanthin 1 L6 18.1 

antheraxanthin 4.9 2.6 

lutein 43.9 41.4 

zeaxanthin 4.7 4.3 

asiaxanthin + adonirubin 0.0 1 .0 
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TABLES 

% of total carotcnoid composition in chronioplasts-containing tissue (flower) 

Wild-t>'pe Transgenic 

beta-carotene 58.1 21.0 

violaxanthin 40.3 1.5 

lutein 0.0 1.1 

zeaxanthin 1.6 1.0 

hy droxyechinenone 0.0 13.7 

3*hydroxyechinenone 0.0 4.1 

adonirubin 0.0 22.4 

adonixanthin 0.0 8.7 

astaxanthin 0.0 26.5 

Please note the elevated content of hydroxyechinenone, 
3'hydroxyechinenone, adonirubin, adonixanthin and astaxanthin especially in 
the chromoplast containing tissue of the transgenic tobacco plants. 

20 Thus, the present invention successfully addresses the shortcomings of the 

presently known configurations by enabling a relatively low cost biotechnological 
production of {3S,yS) astaxanthin by providing a peptide having a (3-C-4- 
oxygenase activity; a DNA segment coding for this peptide; an RNA segments 
coding for this peptide; a recombinant DNA molecule comprising a vector and the 

25 DNA segment; a host containing the above described recombinant DNA molecule 
or DNA segment; and of a method for biotechnologically producing (3SSS) 
astaxanthin or a food additive containing (SS.yS) astaxanthin, using the host. 



15 



30 



While the invention has been described with respect to a limited number of 
embodiments, it will be appreciated that many variations, modifications and other 
applications of the invention may be made. 
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(A) 
(B) 
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CF) 
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SEQUENCE LISTING 

Joseph Hirschberg, Tamar Lotan and 
Mark Barker 

Polynucleotide molecule from 
Haematococcus pluvial is encoding a 
polypeptide having a P-C-4-oxygenase 

activity for biotechnological production of 

(3S,3'S) astaxanthin. 

4 

Mark M. Friedman c/o Robert Sheinbein 
2940 Birchtree space lane 
Si I ver Spring 
Maryland 

United States of America 
20906 

megabyte, 3.5" microdisk 
Twinhead SI imnote-890TX 
MS DOS version 6.2, 
Windows version 3.11 
Word for Windows version 2.0 



Friedmam, Mark 

33,883 

325/5 

972-3-5625553 
972-3-5625554 



1771 base pairs 
nucleic acid 
double 
I inear 

SEQ ID N0:1: 



GGC 


ACG 


AGC 


TTG 


CAC 


GCA 


AGT 


CAG 


CGC 


GCG 


CAA 


GTC 


AAC 


ACC 


TGC 


CGG 


48 


TCC 


ACA 


GCC 


TCA 


AAT 


AAT 


AAA 


GAG 


CTC 


AAG 


CGT 


TTG 


TGC 


GCC 


TCG 


ACG 


96 


TGG 


CCA 


GTC 


TGC 


ACT 


GCC 


TTG 


AAC 


CCG 


CGA 


GTC 


TCC 


CGC 


CGC 


ACT 


GAC 


144 


TGC 


CAT 


AGC 


ACA 


GCT 


AGA 


CGA 


ATG 


CAG 


CTA 


GCA 


GCG 


ACA 


GTA 


ATG 


TTG 


192 


GAG 


CAG 


CTT 


ACC 


GGA 


AGC 


GCT 


GAG 


GCA 


CTC 


AAG 


GAG 


AAG 


GAG 


AAG 


GAG 


240 


GTT 


GCA 


GGC 


AGC 


TCT 


GAC 


GTG 


TTG 


CGT 


ACA 


TGG 


GCG 


ACC 


CAG 


TAC 


TCG 


268 


CTT 


CCG 


TCA 


GAA 


GAG 


TCA 


GAC 


GCG 


GCC 


CGC 


CCG 


GGA 


CTG 


AAG 


AAT 


GCC 


336 


TAG 


AAG 


CCA 


CCA 


CCT 


TCC 


GAC 


ACA 


AAG 


GGC 


ATC 


ACA 


ATG 


GCG 


CTA 


CGT 


384 


GTC 


ATC 


GGC 


TCC 


TGG 


GCC 


GCA 


GTG 


TTC 


CTC 


CAC 


GCC 


ATT 


TTT 


CAA 


ATC 


432 


AAG 


CTT 


CCG 


ACC 


TCC 


TTG 


GAC 


CAG 


CTG 


CAC 


TGG 


CTG 


CCC 


GTG 


TCA 


GAT 


480 


GCC 


ACA 


GCT 


CAG 


CTG 


GTT 


AGC 


GGC 


ACG 


AGC 


AGC 


CTG CTC 


GAC 


ATC 


CTC 


528 
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GTA 


GTA 


TTC 


TTT 


GTC 


CTG 


GAG 


TTC 


CTG 


TAC 


ACA 


GGC 


CTT 


TTT 


ATC 


ACC 


576 


ACG 


CAT 


GAT 


GCT 


ATG 


CAT 


GGC 


ACC 


ATC 


GCC 


ATG 


AGA 


AAC 


AGG 


CAG 


CTT 


624 


AAT 


GAC 


TTC 


TTG 


GGC 


AGA 


GTA 


TGC 


ATC 


TCC 


TTG 


TAC 


GCC 


TGG 


TTT 


GAT 


672 


TAC 


AAC 


ATG 


CTG 


CAC 


CGC 


AAG 


CAT 


TGG 


GAG 


CAC 


CAC 


AAC 


CAC 


ACT 


GGC 


720 


GAG 


GTG 


GGC 


AAG 


GAC 


CCT 


GAC 


TTC 


CAC 


AGG 


GGA 


AAC 


CCT 


GGC 


ATT' 


GTG 


768 


ccc 


TGG 


TTT 


GCC 


AGC 


TTC 


ATG 


TCC 


AGC 


TAC 


ATG 


TCG 


ATG 


TGG 


CAG 


TTT 


816 


GCG 


CGC 


CTC 


GCA 


TGG 


TGG 


ACG 


GTG 


GTC 


ATG 


CAG 


CTG 


CTG 


GGT 


GCG 


CCA 


86A 


ATG 


GCG 


AAC 


CTG 


CTG 


GTG 


TTC 


ATG 


GCG 


GCC 


GCG 


CCC 


ATC 


CTG 


TCC 


GCC 


912 


TTC 


CGC 


TTG 


TTC 


TAC 


TTT 


GGC 


ACG 


TAC 


ATG 


CCC 


CAC 


AAG 


CCT 


GAG 


CCT 


960 


GGC 


GCC 


GCG 


TCA 


GGC 


TCT 


TCA 


CCA 


GCC 


GTC 


ATG 


AAC 


TGG 


TGG 


AAG 


TCG 


1008 


CGC 


ACT 


AGC 


CAG 


GCG 


TCC 


GAC 


CTG 


GTC 


AGC 


TTT 


CTG 


ACC 


TGC 


TAC 


CAC 


1056 


TTC 


GAC 


CTG 


CAC 


TGG 


GAG 


CAC 


CAC 


CGC 


TGG 


CCC 


TTC 


GCC 


CCC 


TGG 


TGG 


110A 


GAG 


CTG 


CCC 


AAC 


TGC 


CGC 


CGC 


CTG 


TCT 


GGC 


CGA GGT 


CTG 


GTT 


CCT 


GCC 


1152 


TAG 


CTG 


GAC 


ACA 


CTG 


CAG 


TGG 


GCC 


CTG 


CTG 


CCA 


GCT 


GGG 


CAT 


GCA 


GGT 


1200 


TGT 


GGC 


AGG 


ACT 


GGG 


TGA 


GGT 


GAA 


AAG 


CTG 


CAG 


GCG 


CTG 


CTG 


CCG 


GAC 


12A8 


ACG 


CTG 


CAT 


GGG 


CTA 


CCC 


TGT 


GTA 


GCT 


GCC 


GCC 


ACT 


AGG 


GGA 


GGG 


GGT 


1296 


TTG 


TAG 


CTG 


TCG 


AGC 


TTG 


CCC 


CAT 


GGA 


TGA 


AGC 


TGT 


GTA 


GTG 


GTG 


CAG 


1344 


GGA 


GTA 


CAC 


CCA 


CAG 


GCC 


AAC 


ACC 


CTT 


GCA 


GGA 


GAT 


GTC 


TTG 


CGT 


CGG 


1392 


GAG 


GAG 


TGT 


TGG 


GCA 


GTG 


TAG 


ATG 


CTA 


TGA 


TTG 


TAT 


CTT 


AAT 


GCT 


GAA 


1440 


GCC 


TTT 


AGG 


GGA 


GCG 


ACA 


CTT 


AGT 


GCT 


GGG 


CAG 


GCA 


ACG 


CCC 


TGC 


AAG 


1488 


GTG 


CAG 


GCA 


CAA 


GCT 


AGG 


CTG 


GAC 


GAG 


GAC 


TCG 


GTG 


GCA 


GGC 


AGG 


TGA 


1536 


AGA 


GGT 


GCG 


GGA 


GGG 


TGG 


TGC 


CAC 


ACC 


CAC 


TGG 


GCA 


AGA 


CCA 


TGC 


TGC 


1584 


AAT 


GCT 


GGC 


GGT 


GTG 


GCA 


GTG 


AGA 


GCT 


GCG 


TGA 


TTA 


ACT 


GGG 


CTA 


TGG 


1632 


ATT 


GTT 


TGA 


GCA 


GTC 


TCA 


CTT 


ATT 


CTT 


TGA 


TAT 


AGA 


TAC 


TGG 


TCA 


GGC 


1680 


AGG 


TCA 


GGA 


GAG 


TGA 


GTA 


TGA 


ACA 


AGT 


TGA 


GAG 


GTG 


GTG 


CGC 


TGC 


CCC 


1728 


TGC 


GCT 


TAT 


GAA 


GCT 


GTA 


ACA 


ATA 


AAG 


TGG 


TTC 












1771 



(2> INFORMATION FOR SEO 10 N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1771 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 



GGC 


ACG 


AGC 


UUG 


CAC 


GCA 


AGU 


CAG 


CGC 


GCG 


CAA 


GUC 


AAC 


ACC 


UGC 


CGG 


'48 


UCC 


ACA 


GCC 


UCA 


AAU 


AAU 


AAA 


GAG 


cue 


AAG 


CGU UUG 


UGC 


GCC 


UCG 


ACG 


96 


UGG 


CCA 


GUC 


UGC 


ACU 


GCC 


UUG 


AAC 


CCG 


CGA 


GUC 


UCC 


CGC 


CGC 


ACU 


GAC 


144 


UGC 


CAU 


AGC 


ACA 


GCU 


AGA 


CGA 


AUG 


CAG 


CUA 


GCA 


GCG 


ACA 


GUA 


AUG 


UUG 


192 


GAG 


CAG 


CUU 


ACC 


GGA 


AGC 


GCU 


GAG 


GCA 


cue 


AAG 


GAG 


AAG 


GAG 


AAG 


GAG 


240 


GUU 


GCA 


GGC 


AGC 


UCU 


GAC 


GUG 


UUG CGU 


ACA 


UGG 


GCG 


ACC 


CAG 


UAC 


UCG 


288 


CUU 


CCG 


UCA 


GAA 


GAG 


UCA 


GAC 


GCG 


GCC 


CGC 


CCG 


GGA 


CUG 


AAG 


AAU 


GCC 


336 


UAC 


AAG 


CCA 


CCA 


ecu 


UCC 


GAC 


ACA 


AAG 


GGC 


AUC 


ACA 


AUG 


GCG 


CUA 


CGU 


384 


GUC 


AUC 


GGC 


UCC 


UGG 


GCC 


GCA 


GUG 


UUC 


cue 


CAC 


GCC 


AUU 


UUU 


CAA 


AUC 


432 


AAG 


CUU 


CCG 


ACC 


UCC 


UUG 


GAC 


CAG 


CUG 


CAC 


UGG 


CUG 


CCC 


GUG 


UCA 


GAU 


480 


GCC 


ACA 


GCU 


CAG 


CUG 


GUU 


AGC 


GGC 


ACG 


AGC 


AGC 


CUG 


cue 


GAC 


AUC 


GUC 


528 


GUA 


GUA 


UUC 


UUU 


GUC 


CUG 


GAG 


UUC 


CUG 


UAC 


ACA 


GGC 


CUU 


UUU 


AUC 


ACC 


576 


ACG 


CAU 


GAU 


GCU 


AUG 


CAU 


GGC 


ACC 


AUC 


GCC 


AUG 


AGA 


AAC 


AGG 


CAG 


CUU 


624 


AAU 


GAC 


UUC 


UUG 


GGC 


AGA 


GUA 


UGC 


AUC 


UCC 


UUG 


UAC 


GCC 


UGG 


UUU 


GAU 


672 


UAC 


AAC 


AUG 


CUG 


CAC 


CGC 


AAG 


CAU 


UGG 


GAG 


CAC 


CAC 


AAC 


CAC 


ACU 


GGC 


720 


GAG 


GUG 


GGC 


AAG 


GAC 


ecu 


GAC 


UUC 


CAC 


AGG 


GGA 


AAC 


ecu 


GGC 


AUU 


GUG 


768 


CCC 


UGG 


UUU 


GCC 


AGC 


UUC 


AUG 


UCC 


AGC 


UAC 


AUG 


UCG 


AUG 


UGG 


CAG 


UUU 


616 


GCG 


CGC 


cue 


GCA 


UGG 


UGG 


ACG 


GUG 


GUC 


AUG 


CAG 


CUG 


CUG 


GGU 


GCG 


CCA 


864 


AUG 


GCG 


AAC 


CUG 


CUG 


GUG 


UUC 


AUG 


GCG 


GCC 


GCG 


CCC 


AUC 


CUG 


UCC 


GCC 


912 


UUC 


CGC 


UUG 


UUC 


UAC 


UUU 


GGC 


ACG 


UAC 


AUG 


CCC 


CAC 


AAG 


ecu 


GAG 


ecu 


960 


GGC 


GCC 


GCG 


UCA 


GGC 


UCU 


UCA 


CCA 


GCC 


GUC 


AUG 


AAC 


UGG UGG 


AAG 


UCG 


1008 


CGC 


ACU 


AGC 


CAG 


GCG 


UCC 


GAC 


CUG 


GUC 


AGC 


UUU 


CUG 


ACC 


UGC 


UAC 


CAC 


1056 
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i II IP 


GAC 


CUG 


CAC 


UGG 


GAG 


CAC 


CAC 


CGC 


UGG 


CCC 


44 
UUC GCC 


CCC 


UGG 


UGG 


110A 


GAG 


CUG 


CCC 


AAC 


UGC 


CGC 


CGC 


CUG 


UCU 


GGC 


CGA 


GGU 


CUG 


GUU 


ecu 


GCC 


1152 


UAG 


CUG 


GAC 


ACA 


CUG 


CAG 


UGG 


GCC 


CUG 


CUG 


CCA 


GCU 


GGG 


CAU 


GCA 


GGU 


1200 


UGU 


GGC 


AGG 


ACU 


GGG 


UGA 


GGU 


GAA 


AAG 


CUG 


CAG 


GCG 


CUG 


CUG 


CCG 


GAC 


1248 


ACG 


CUG 


CAU 


GGG 


CUA 


CCC 


UGU 


GUA 


GCU 


GCC 


GCC 


ACU 


AGG 


GGA 


GGG GGU 


1296 


UUb 


UAG 


CUG 


UCG 


AGC 


UUG 


CCC 


CAU 


GGA 


UGA 


AGC 


UGU 


GUA 


GUG 


GUG 


CAG 


1344 


ubA 


GUA 


CAC 


CCA 


CAG 


GCC 


AAC 


ACC 


CUU 


GCA 


GGA 


GAU 


GUC 


UUG 


CGU 


CGG 


1392 


UMu 




UGU 


UGG 


GCA 


GUG 


UAG 


AUG 


CUA 


UGA 


UUG 


UAU 


CUU 


AAU 


GCU 


GAA 


1440 


GCC 


UUU 


AGG 


uuM 


bLu 


ACA 


CUU 


AGU 


GCU 


GGG 


CAG 


GCA 


ACG 


CCC 


UGC 


AAG 


1488 


GUG 


CAG 


GCA 


CAA 


GCU 


AGG 


CUG 


GAC 


GAG 


GAC 


UCG 


GUG 


GCA 


GGC 


AGG 


UGA 


1536 


AGA 


GGU 


GCG 


GGA 


GGG 


UGG 


UGC 


CAC 


ACC 


CAC 


UGG 


GCA 


AGA 


CCA 


UGC 


UGC 


1584 


AAU 


GCU 


GGC 


GGU 


GUG 


GCA 


GUG 


AGA 


GCU 


GCG 


UGA 


UUA 


ACU 


GGG 


CUA 


UGG 


1632 


AUU 


GUU 


UGA 


GCA 


GUC 


UCA 


CUU 


AUU 


CUU 


UGA 


UAU 


AGA 


UAC 


UGG 


UCA 


GGC 


1680 


AGG 


UCA 


GGA 


GAG 


UGA 


GUA 


UGA 


ACA 


AGU 


UGA 


GAG 


GUG 


GUG 


CGC 


UGC 


CCC 


1728 


UGC 


GCU 


UAU 


GAA 


GCU 


GUA 


ACA 


AUA 


AAG 


UGG 


UUC 
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(2) INFORMATION FOR SEQ ID N0:3: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1771 base pairs 

<B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 



GGC 


ACG 


AGC 


TTG 


CAC 


GCA 


AGT 


CAG 


CGC 


GCG 


CAA GTC 


AAC 


ACC 


TGC 


CGG 


48 


TCC 


ACA 


GCC 


TCA 


AAT 


AAT 


AAA 


GAG 


CTC 


AAG 


CGT TTG 


TGC 


GCC 


TCG 


ACG 


96 


TGG 


CCA 


GTC 


TGC 


ACT 


GCC 


TTG 


AAC 


CCG 


CGA 


GTC TCC 


CGC 


CGC 


ACT 


GAC 


144 


TGC 


CAT 


AGC 


ACA 


GCT 


AGA 


CGA 


ATG 


CAG 


CTA 


GCA GCG 


ACA 


GTA 


ATG 


TTG 


192 
















Met 


Gin 


Leu 


Ala Ala 


Thr 


Val 


Met 


Leu 




GAG 


CAG 


CTT 


ACC 


GGA 


AGC 


GCT 


GAG 


GCA 


CTC 


5 

AAG GAG 


AAG 


GAG 


AAG 


GAG 


240 


Glu 


Gin 


Leu 


Thr 


Gly 


Ser 


Ala 


Glu 


Ala 


Leu Lys Glu 


Lys 


Glu 


Lys 


Glu 




10 










15 










20 








25 




GTT 


GCA 


GGC 


AGC 


TCT 


GAC 


GTG 


TTG 


CGT 


ACA 


TGG GCG 


ACC 


CAG 


TAC 


TCG 


288 


Val 


Ala Gly Ser Sen 


Asp Val 


Leu 


Arg 


Thr 


Trp Ala 


Thr 


Gin 


Tyr 


Ser 












30 










35 








40 






CTT 


CCG 


TCA 


GAA 


GAG 


TCA 


GAC 


GCG 


GCC 


CGC 


CCG GGA 


CTG 


AAG 


AAT 


GCC 


336 


Leu 


Pro 


Ser 


Glu 


Glu 


Ser Asp Ala 


Ala 


Arg Pro Gly 


Leu 


Lys 


Asn 


Ala 










45 










50 








55 








TAC 


AAG 


CCA 


CCA 


CCT 


TCC 


GAC 


ACA 


AAG 


GGC 


ATC ACA 


ATG 


GCG 


CTA 


CGT 


364 


Tyr 


Lys 


Pro 


Pro 


Pro 


Ser Asp Thr 


Lys 


Gly 


lie Thr 


Met 


Ala 


Leu Arg 








60 










65 








70 










GTC 


ATC 


GGC 


TCC 


TGG 


GCC 


GCA 


GTG 


TTC 


CTC 


CAC GCC 


ATT 


TTT 


CAA 


ATC 


432 


Val 


I le 


Gly Ser Trp 


Ala 


Ala 


Val 


Phe 


Leu 


His Ala 


lie 


Phe 


Gin 


lie 






75 










80 








85 












AAG 


CTT 


CCG 


ACC 


TCC 


TTG 


GAC 


CAG 


CTG 


CAC 


TGG CTG 


CCC 


GTG 


TCA 


GAT 


480 


Lys 


Leu 


Pro 


Thr 


Ser 


Leu Asp Gin 


Leu 


His 


Trp Leu 


Pro 


Val 


Ser Asp 




90 










95 










100 








105 




GCC 


ACA 


GCT 


CAG 


CTG 


GTT 


AGC 


GGC 


ACG 


AGC 


AGC CTG 


CTC 


GAC 


ATC 


GTC 


528 


Ala 


Thr 


Ala 


Gin 


Leu 


Val 


Ser Gly 


Thr 


Ser 


Ser Leu 


Leu 


Asp 


He Val 












110 










115 








120 






GTA 


GTA 


TTC 


TTT 


GTC 


CTG 


GAG 


TTC 


CTG 


TAC 


ACA GGC 


CTT 


TTT 


ATC 


ACC 


576 


Val 


Val 


Phe 


Phe 


Val 


Leu 


Glu 


Phe 


Leu 


Tyr Thr Gly 


Leu 


Phe 


He 


Thr 










125 










130 








135 








ACG 


CAT 


GAT 


GCT 


ATG 


CAT 


GGC 


ACC 


ATC 


GCC 


ATG AGA 


AAC 


AGG 


CAG 


CTT 


624 


Thr 


His 


Asp Ala Met 


His 


Gly Thr 


He 


Ala Met Arg 


Asn 


Arg 


Gin 


Leu 





145 150 
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AAT 


GAC 


TTC 


TTG 


GGC 


AGA 


GTA 


TGC ATC 


TCC 


TTG 


TAC 


GCC 


TGG 


TTT 


GAT 


672 


Asn 


Asp Phe 


Leu Gly Arg 


Val 


Cys lie 


Ser 


Leu Tyr Ala 


Trp Phe Asp 






155 










160 








165 












TAC 


AAC 


ATG 


CTG 


CAC 


CGC 


AAG 


CAT TGG 


GAG 


CAC 


CAC 


AAC 


CAC 


ACT 


GGC 


720 


Tyr 


Asn 


Met 


Leu 


His 


Arg 


Lys 


His Trp Glu 


His 


His 


Asn 


His 


Thr 


Gly 




170 










175 








180 










185 




GAG 


GTG 


GGC 


AAG 


GAC 


CCT 


GAC 


TTC CAC 


AGG 


GGA 


AAC 


CCT 


GGC 


ATT 


GTG 


768 


Glu 


Vat 


Gly 


Lys Asp Pro Asp Phe His Arg 


Gly Asn Pro Gly 


lie 


Val 












190 








195 










200 






CCC 


TGG 


TTT 


GCC 


AGC 


TTC 


ATG 


TCC AGC 


TAC 


ATG 


TCG 


ATG 


TGG 


CAG 


TTT 


816 


Pro 


Trp Phe 


Ala 


Ser 


Phe 


Met 


Ser Ser 


Tyr 


Met 


Ser 


Met 


Trp Gin Phe 










205 








210 










215 








GCG 


CGC 


CTC 


GCA 


TGG 


TGG 


ACG 


GTG GTC 


ATG 


CAG 


CTG 


CTG 


GGT 


GCG 


CCA . 


864 


Ala 


Arg 


Leu 


Ala Trp Trp Thr Val Val 


Met 


Gin Leu Leu Gly Ala Pro 








220 










225 








230 










ATG 


GCG 


AAC 


CTG 


CTG 


GTG 


TTC 


ATG GCG 


GCC 


GCG 


CCC 


ATC 


CTG 


TCC 


GCC 


912 


Met 


Ala Asn 


Leu 


Leu Val 


Phe 


Met Ala 


Ala 


Ala 


Pro 


I le 


Leu 


Ser 


Ala 






235 










240 








245 












TTC 


CGC 


TTG 


TTC 


TAC 


TTT 


GGC 


ACG TAC 


ATG 


CCC 


CAC 


AAG 


CCT 


GAG 


CCT 


960 


Phe 


Arg 


Leu 


Phe 


Tyr Phe Gly Thr Tyr Met 


Pro 


His 


Lys 


Pro 


Glu 


Pro 




250 










255 








260 










265 




GGC 


GCC 


GCG 


TCA 


GGC 


TCT 


TCA 


CCA GCC 


GTC 


ATG 


AAC 


TGG 


TGG 


AAG 


TCG 


1008 


Gly 


Ala 


Ala 


Ser Gly Ser Ser Pro Ala Val 


Met 


Asn Trp Trp Lys Ser 












270 








275 










280 






CGC 


ACT 


AGC 


CAG 


GCG 


TCC 


GAC 


CTG GTC 


AGC 


TTT 


CTG 


ACC 


TGC 


TAC 


CAC 


1056 


Arg 


Thr 


Ser 


Gin Ala Ser Asp Leu Val 


Ser 


Phe 


Leu 


Thr 


Cys 


Tyr 


His 










285 








290 










295 








TTC 


GAC 


CTG 


CAC 


TGG 


GAG 


CAC 


CAC CGC 


TGG 


CCC 


TTC 


GCC 


CCC 


TGG 


TGG 


1104 


Phe 


Asp Leu 


His 


Trp 


Glu 


His 


His Arg Trp Pro Phe Ala Pro Trp Trp 








300 










305 








310 










GAG 


CTG 


CCC 


AAC 


TGC 


CGC 


CGC 


CTG TCT 


GGC 


CGA 


GGT 


CTG 


GTT 


CCT 


GCC 


1152 


Glu 


Leu 


Pro 


Asn Cys Arg Arg Leu Ser Gly Arg Gly Leu Val 


Pro 


Ala 






315 










320 








325 












TAG 


CTG 


GAC 


ACA 


CTG 


CAG 


TGG 


GCC CTG 


CTG 


CCA 


GCT 


GGG 


CAT 


GCA 


GGT 


1200 


TGT 


GGC 


AGG 


ACT 


GGG 


TGA 


GGT 


GAA AAG 


CTG 


CAG 


GCG 


CTG 


CTG 


CCG 


GAC 


1248 


ACG 


CTG 


CAT 


GGG 


CTA 


CCC 


TGT 


GTA GCT 


GCC 


GCC 


ACT 


AGG 


GGA 


GGG 


GGT 


1296 


TTG 


TAG 


CTG 


TCG 


AGC 


TTG 


CCC 


CAT GGA 


TGA 


AGC 


TGT 


GTA 


GTG 


GTG 


CAG 


1344 


GGA 


GTA 


CAC 


CCA 


CAG 


GCC 


AAC 


ACC CTT 


GCA 


GGA 


GAT 


GTC 


TTG 


CGT 


CGG 


1392 


GAG 


GAG 


TGT 


TGG 


GCA 


GTG 


TAG 


ATG CTA 


TGA 


TTG 


TAT 


CTT 


AAT 


GCT 


GAA 


1440 


GCC 


TTT 


AGG 


GGA 


GCG 


ACA 


CTT 


AGT GCT 


GGG 


CAG 


GCA 


ACG 


CCC 


TGC 


AAG 


1488 


GTG 


CAG 


GCA 


CAA 


GCT 


AGG 


CTG 


GAC GAG 


GAC 


TCG 


GTG 


GCA 


GGC 


AGG 


TGA 


1536 


AGA 


GGT 


GCG 


GGA 


GGG 


TGG 


TGC 


CAC ACC 


CAC 


TGG 


GCA 


AGA 


CCA 


TGC 


TGC 


1584 


AAT 


OCT 


GGC 


GGT 


GTG 


GCA 


GTG 


AGA GCT 


GCG 


TGA 


TTA 


ACT 


GGG 


CTA 


TGG 


1632 


ATT 


GTT 


TGA 


GCA 


GTC 


TCA 


CTT 


ATT CTT 


TGA 


TAT 


AGA 


TAC 


TGG 


TCA 


GGC 


1680 


AGG 


TCA 


GGA 


GAG 


TGA 


GTA 


TGA 


ACA AGT 


TGA 


GAG 


GTG 


GTG 


CGC 


TGC 


CCC 


1728 


TGC 


GCT 


TAT 


GAA 


GCT 


GTA 


ACA 


ATA AAG 


TGG 


TTC 












1771 



(2) INFORMATION FOR SEO ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 amino acids 

(B> TYPE: amino acid 

(C) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:4: 



Met Gin Leu Ala Ala Thr Val Met Leu 
5 

Glu Gin Leu Thr Gly Ser Ala Glu Ala Leu Lys Glu Lys Glu Lys Glu 



# 
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10 15 20 25 

Val Ala Gly Ser Ser Asp Val Leu Arg Thr Trp Ala Thr Gin Tyr Ser 

30 35 AO 

Leu Pro Ser Gtu Glu Ser Asp Ala Ala Arg Pro Gly Leu Lys Asn Ala 

^5 50 55 

Tyr Lys Pro Pro Pro Ser Asp Thr Lys Gly He Thr Met Ala Leu Arg 

60 65 70 

Val He Gly Ser Trp Ala Ala Val Phe Leu His Ala lie Phe Gin lie 

75 80 85 

Lys Leu Pro Thr Ser Leu Asp Gin Leu His Trp Leu Pro Val Ser Asp 
90 95 100 105 

Ala Thr Ala Gin Leu Val Ser Gly Thr Ser Ser Leu Leu Asp He Val 

no 115 120 

Val Val Phe Phe Val Leu Glu Phe Leu Tyr Thr Gly Leu Phe He Thr 

125 130 135 

Thr His Asp Ala Met His Gly Thr He Ala Met Arg Asn Arg Gin Leu 

1^0 145 150 

Asn Asp Phe Leu Gly Arg Val Cys He Ser Leu Tyr Ala Trp Phe Asp 

155 160 165 

Tyr Asn Met Leu His Arg Lys His Trp Glu His His Asn His Thr Gly 
170 175 180 185 

Glu Val Gly Lys Asp Pro Asp Phe His Arg Gly Asn Pro Gly He Val 

190 195 200 

Pro Trp Phe Ala Ser Phe Met Ser Ser Tyr Met Ser Met Trp Gin Phe 

205 210 215 

Ala Arg Leu Ala Trp Trp Thr Val Val Met Gin Leu Leu Gly Ala Pro 

220 225 230 

Met Ala Asn Leu Leu Val Phe Met Ala Ala Ala Pro He Leu Ser Ala 

235 240 245 

Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Met Pro His Lys Pro Glu Pro 
250 255 260 265 

Gly Ala Ala Ser Gly Ser Ser Pro Ala Val Met Asn Trp Trp Lys Ser 

270 275 280 

Arg Thr Ser Gin Ala Ser Asp Leu Val Ser Phe Leu Thr Cys Tyr His 

285 290 295 

Phe Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala Pro Trp Trp 

300 305 310 

Glu Leu Pro Asn Cys Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala 
315 320 325 
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WHAT IS CLAIMED IS: 

1. A DNA segmenl comprising a nucleotide sequence coding for a 
polypeptide having an amino acid sequence as set forth in SEQ ID NO:4 

2. A DNA segment as in claim 1, wherein said nucleotide sequence is a 
variant selected from the group of variants consisting of allelic variants, species 
variants, naturally occurring variants, man-induced variants and combinations 
thereof 

3. A DNA segment as in claim 1, wherein said nucleotide sequence 
includes a sequence as set forth in SEQ ID NO:l. 

4. A DNA segment as in claim K wherein said nucleotide sequence 
includes a sequence as set forth between and including nucleotides 166 and 1 152 
of SEQ ID NO: I. 

5. A DNA segment comprising a nucleotide sequence hybridizing 
under high stringency conditions to a nucleic acid probe, said probe including at 
least fifteen successive nucleotides of SEQ ID NO:L 

6. A DNA segment comprising a nucleotide sequence hybridizing 
under low-stringency conditions to a nucleic acid probe, said probe including 
nucleotides 166 through 1 152 of SEQ ID NO:l. 

7. A DNA segment comprising a nucleotide sequence hybridizing 
under high stringency conditions to a nucleic acid probe, said probe including at 
least fifteen successive nucleotides of SEQ ID NO:2. 

8. A DNA segment comprising a nucleotide sequence hybridizing 
under low-stringency conditions to a nucleic acid probe, said probe including 
nucleotides 1 66 through 1 1 52 of SEQ ID NO:2. 

9. An RNA segment comprising a nucleotide sequence coding for a 
polypeptide having an amino acid sequence as set forth in SEQ ID NO:4 

10. An RNA segment as in claim 9, wherein said nucleotide sequence is 
a variant selected from the group of variants consisting of allelic variants, species 
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variants, naturally occurring variants, man-induced variants and combinations 
thereof. 

11. An RNA segment as in claim 9, wherein said nucleotide sequence 
includes a sequence as set forth in SEQ ID NO:2. 

12. An RNA segment as in claim 9, wherein said nucleotide sequence 
includes a sequence as set forth between and including nucleotides 166 and 1 152 
ofSEQlDNO:2. 

13. An RNA segment comprising a nucleotide sequence hybridizing 
under high stringency conditions to a nucleic acid probe, said probe including at 
least fifteen successive nucleotides of SEQ ID NO:l . 

14. An RNA segment comprising a nucleotide sequence hybridizing 
under low-stringency conditions to a nucleic acid probe, said probe including 
nucleotides 1 66 through 1 1 52 of SEQ ID NO: 1 . 

15. An RNA segment comprising a nucleotide sequence hybridizing 
under high stringency conditions to a nucleic acid probe, said probe including at 
least fifteen successive nucleotides of SEQ ID NO:2. 

16. An RNA segment comprising a nucleotide sequence hybridizing 
under low-stringency conditions to a nucleic acid probe, said probe including 
nucleotides 166 through 1 152 of SEQ ID NO:2. 

17. A polypeptide comprising an amino acid sequence corresponding to 
Haematococcus pluvialis crtO gene, allelic and species variants, and functional 
naturally occurring and man-induced variants thereof 

18. A polypeptide as in claim 1 7, wherein said amino acid sequence is as 
set forth in SEQ ID NO:4. 

19. A polypeptide comprising an amino acid sequence homologous to 
the sequence set forth in SEQ ID NO:4. 

20. A polypeptide comprising an amino acid sequence being encoded by 
a DNA segment, said DNA segment hybridizing under low stringency conditions 
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to nucleotides 166 through 1 152 of SEQ ID N0:1, the polypeptide having a P-C-4- 
oxygenase activity. 

21. A recombinant vector DNA molecule comprising a DNA segment as 
in claim 2. 



22. A host comprising a recombinant vector DNA molecule as in claim 
21, said host is selected from the group consisting of a cell and an organism. 

23. A host comprising a DNA segment as in claim 2, said host is 
selected from the group consisting of a cell and an organism. 

24. A method of producing xanthophylls selected from the group 
consisting of astaxanthin, canthaxanthin, echinenone, cryptoxanthin, 
isoci7ptoxanthin, hydroxyechinenone, zeaxanthin, adonirubin or adonixanthin and 
combinations thereof, comprising the steps of: 

(a) providing a host as in claim 22; 

(b) providing said host with growing conditions for production of the 
xanthophylls; and 

(c) extracting the xanthophylls from said host. 

25. A method of producing xanthophylls selected from the group 
consisting of astaxanthin, canthaxanthin, echinenone, isocryptoxanthin. 
cryptoxanthin, hydroxyechinenone, zeaxanthin, adonirubin or adonixanthin and 
combinations thereof, comprising the steps of: 

(a) providing a host as in claim 23; 

(b) providing said host with growing conditions for production of the 
xanthophylls; and 

(c) extracting the xanthophylls from said host. 



26. A host as in claim 22, wherein said host is used as a food additive. 



27. A host as in claim 23, wherein said host is used as a food additive. 



28. A transgenic plant expressing a transgene coding for a polypeptide 
including an amino acid sequence corresponding to Haematococcus pluvialis crtO 
gene, allelic and species variants or functional naturally occurring or man-induced 
variants thereof. 
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29. A transgenic plant as in claim 28, wherein said expression is highest 
in chromoplasts-containing tissues. 

30. A recombinant DNA vector comprising a first DNA segment 
encoding a polypeptide for directing a protein into plant chloroplasls or 
chromoplasts and an in frame second DNA segment encoding a polypeptide 
including an amino acid sequence corresponding to Haematococcus pluvialis crtO 
gene, allelic and species variants or functional naturally occurring and man- 
induced variants thereof 

31. A recombinant DNA vector as in claim 30, wherein said first DNA 
segment is derived from the Pds gene of tomato. 

32. A recombinant DNA vector comprising a first DNA segment 
including a promoter highly expressible in plant chloroplasts or chromoplasts- 
containing tissues and a second DNA segment encoding a polypeptide including an 
amino acid sequence corresponding to Haematococcus pluvialis crtO gene, allelic 
and species variants or ftinctional naturally occurring and man-induced variants 
thereof. 

33. A recombinant DNA vector as in claim 30, wherein said first DNA 
segment is derived from the Pds gene of tomato. 
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FIG. 2-A 
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.rr-J^ ^ ^"-0 250 

AIAw\w.ttacCCXaaCCCCTC aCCCaCTCAaCCaCaaCC AC - AACC ACCTTCOkCCCACC 

ATCCACrTrcCATCCCOCTAArCCTa:ACCAC>JkACC 

10 20 30 40 30 60 

_^ 2aO 290 300 310 

'V ^'^^ ,7y^^^"^^^^^^^TCCcccAa:c^ i ictx icacajixiactcacacccc 

" tJ5 I • •••••..» •■*.. , ,« _ 

CCACACCTCrrCACAGCCTCCCCCACACACTATCACATCaiATXXrACTCC^ 

70 iO 90 100 HO 120 

^20 330 3^0 350 360 370 

CCCCCCCCCCCACTCAACAATCCCTACAACCCAC^ 

CCTCCTCinTXCCTAAACcicCCC^^ 

1*^0 150 160 170 180 

380 390 coo LIO 620 630 

ATCCCCCTACCTCTCATOXCTCCTCCCCCCCACT^^ 

ATCCCa^ACCATaiTCC^ 

190 200 210 220 230 240 

^^0 450 460 470 480 490 

AACcrraxAccTccrnxACCACCTccAcrc^^ 

• ... ..... ... .......... , • 

* ... .......... ji ... 

ACCCTACCCACATCCATCCACCACXTrCACTOrrrCC^ 

250 260 270 280 290 300 

500 310 520 530 5^0 550 

crrrrccccccA^ccACciccCTAOx^ 

,310 320 330 340 350 360 

^^0 570 580 590 600 610 

CTCTACACACCCCii 1 1 iATCACCACCCATCATCCTATCCATCCCACCATCCCCATCACA 

* * •-•-.« ., :;;;r:j: :x::: :: •- .... 
CrCTTACACTCCTCTArrCATCACCACACATCACCCAATCCATC^ 

320 380 390 400 4 10 420 

^20 630 640 650 660 670 

AACACCCACCrrAATCACnCrrCCCCACACTATCCATCTCCTrCTACarCTCCTTrcAT 
:::::::::: ::::: : . 

CACACCCACCTCAATCATCTCCrrcCCAACATCTCCATATCACTCTACCCC^ 

OO 440 450 460 470 480 

^30 690 700 710 720 730 

TACAACATCCTCCACCCCAACCAnCCCACCACCACAACCACACTCCCCACCTCCCCAAC 

:::: :::::::: ::::::::::::::::: :::::::: : 

TACACCATCCTCC^TCOCUCCACTCCCAOlACCVCA^CCATACnXXCAACTCCCCAAA 
^90 500 510 520 530 540 
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•JiiJ.jj***"***" ...... .. ... . 
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550 560 570 530 590 600 

»00 810 820 830 840 850 
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jjjjjjjj... •..•••...«.., ,^ ... 

ACCTACATCTCCCTCTCCCACrnCCCCCCCTCC 

620 630 640 650 660 

860 870 830 890 900 910 

CTCCC7C-CCCAATCCCCAACCTCCTCCTCTTCA7CCCCCCCCCCCCCATCCTCTCCCCC 
!r**i * >...« 

ctccccccccccatcco.Utctcctactc^ 

"0 630 690 700 710 720 

^nr-r^^^ ^50 960 970 

rrCCCCrrCrrCTACTTTCCCACCTACATCCOCCACAACCCTCACCCTCCCCCCCCCTCA 

:::::: : :::::::: s:::: ::: 

rrCCCCCTCTTCTACTTCCCC^CTTACCTCCCACACAAOCr^ 

'30 740 750 760 770 780 
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FIG. 2«B 
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FIG. 3 
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Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. Claims Nos.: 1, 3-5, 7, 9. 11-13. 15 and 18 

because they relate to parts of the international application thai do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specificaUy: 

The claims are drawn to specific nucleic acid and amino acid sequences of SEQ ID NO: 2 and SEQ ID NO: 4, 
respectively. Applicants have not provided the sequences in computer readable form to search the sequences in 
commercial data bases. Thus, the claims could not be meaningfully searched. 

3. I I Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4<a). 
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This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet. 
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claims. 



2. I I As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. I I As only some of the required additional search fees were timely paid by the applicant, this international seareh repoit covers 

only those claims for which fees were paid, specifically claims Nos.: 



^* I I No required additional seareh fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

( j No protest accompanied the payment of additional seareh fees. 



Form PCT/ISA/210 (continuation of first shect(l))(Ju]y 1992)* 



INTERNAT 



L SEARCH REPORT 



lnid^^R>nal application No. 
PCT/US97/J7819 



B. FIELDS SEARCHED 

Electronic data bases consuUed (Name of data base and where practicable terms used): 

APS, STN: Medline, Caplus, Sciscarch, Lifcsci» Biosis, Embase, Wpids, Agricola, and Biotechds. Search terms: 
haematococcus, pluvtalis, DNA, cDNA, Sequence. Astaxanthin, canthaxanthin, cchincnone, isocryptoxanthsn, 
crytoxanthin, hydroxyechincnone, zeaxanthin, adonirubin, adonixanthin and crtO. 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKJNG 
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This application contains the following inventions or groups of inventions which arc not so linked as to form a single 
inventive concept under PCT Rule 13.1. In order for all inventions to be searched, the appropriate additional search 
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Group 1, claims 2, 6. 8, 10, 14, 16, 17, and 19-27, drawn to DNA coding for an oxido- reductase which is involve in 
the biosynthesis of astaxanthin, the enzyme of SEQ ID NO: 4, vector, host cell and a method of making xanthophylls. 

Group II, claims 28-33, drawn to a transgenic plants, and a special vector for transfonning plants. 

The inventions listed as Groups 1 and II do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13,2, they lack the same or corresponding special technical features for the following reasons: The special 
technical feature of the invention of Group I is the DNA coding for the oxido> reductase of SEQ ID NO: 4, Group I 
encompass the DNA, the enzyme, vector containing the said DNA, host cell and a single use for the DNA which is the 
method of producing xanthophylls. The invention of Group II is drawn to a transgenic plant containing the DNA of 
Group 1 and vectors useful for transfonning plants. Thus, the invention of Group 11 represent a second use for the 
DNA. Thus, the inventions of Groups I and II are not so linked with a special technical feature as to form a single 
inventive concept under PCT Rule 13.1, see 37 C.F.R. 1.475(d). 
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